Compositions comprising isolated Helicobacter pylori CagI polynucleotides and method of preparation thereof

ABSTRACT

Helicobacter pylori is known to cause or be a cofactor in type B gastritis, peptic ulcers, and gastric tumors. In both developed and developing countries, a high percentage of people are infected with this bacterium. The present invention relates generally to a certain H. pylori region located 5&#39; to the CagA gene locus, to proteins encoded thereby, and to the use of these genes and proteins for diagnostic and vaccine applications.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. Ser. No. 08/425,194 (attorney docket no. 00335.001), filed Apr. 20, 1995 now abandoned. This application is also a continuation-in-part of Covacci et al. copending application Ser. No. 08/471,491, filed Jun. 6, 1995, which is a divisional of application Ser. No. 08/256,848, filed Oct. 21, 1994 now abandoned, which is a U.S. national stage application of, PCT/EP93/00472, filed Mar. 2, 1993 and PCT/EP93/00158, filed Jan. 25, 1993, which PCT applications claimed priority benefit under 35 USC §119 of Italian application Serial No. FI92 A 000052, filed Mar. 2, 1992.

BACKGROUND OF THE INVENTION

1. Field of the Disclosure

The present invention relates generally to certain Helicobacter pylori genetic regions, to the proteins expressed by these regions, and to the use of these genes and proteins for diagnostic and vaccine applications.

2. Brief Description of Related Art

Helicobacter pylori is a curved, microaerophilic, gram negative bacterium that has been isolated for the first time in 1982 from stomach biopsies of patients with chronic gastritis, Warren et al., Lancet i: 1273-75 (1983). Originally named Campylobacter pylori, it has been recognized to be part of a separate genus named Helicobacter, Goodwin et al., Int. J. Syst. Bacteriol. 39: 397-405 (1989). The bacterium colonizes the human gastric mucosa, and infection can persist for decades. During the last few years, the presence of the bacterium has been associated with chronic gastritis type B, a condition that may remain asymptomatic in most infected persons but increases considerably the risk of peptic ulcer and gastric adenocarcinoma. The most recent studies strongly suggest that H. pylori infection may be either a cause or a cofactor of type B gastritis, peptic ulcers, and gastric tumors, see e.g., Blaser, Gastroenterology 93: 371-83 (1987); Dooley et al., New Engl. J. Med. 321: 1562-66 (1989); Parsonnet et al., New Engl. J. Med. 325: 1127-31 (1991). H. pylori is believed to be transmitted by the oral route, Thomas et al., Lancet i: 340, 1194 (1992), and the risk of infection increases with age, Graham et al., Gastroenterology 100: 1495-1501 (1991), and is facilitated by crowding, Drumm et al., New Engl. J. Med. 4322: 359-63 (1990); Blaser, Clin. Infect. Dis. 15: 386-93 (1992). In developed countries, the presence of antibodies against H. pylori antigens increases from less than 20% to over 50% in people 30 and 60 years old respectively, Jones et al., Med. Microbio. 22: 57-62 (1986); Morris et al., N. Z. Med. J. 99: 657-59 (1986), while in developing countries over 80% of the population are already infected by the age of 20, Graham et al., Digestive Diseases and Sciences 36: 1084-88 (1991).

H. pylori factors that have been identified so far include the flagella that are probably necessary to move across the mucus layer, see e.g., Leying et al., Mol. Microbiol. 6: 2863-74 (1992); the urease that is necessary to neutralize the acidic environment of the stomach and to allow initial colonization, see e.g., Cussac et al., J. Bacteriol. 174: 2466-73 (1992), Perez-Perez et al., J. Infect. Immun. 60: 3658-3663 (1992), Austin et al., J. Bacteriol. 174: 7470-73 (1992), PCT Publ. No. WO 90/04030; the H. pylori cytotoxin (sometimes referred to as VacA, as it causes vacuolation), see e.g., PCT Publ. No. WO 93/18150, Telford, J. L. et al., J. Exp. Med. 179: 1653-58 (1994), Cover et al., J. Bio. Chem. 267: 10570-75 (1992), Cover et al., J. Clin. Invest. 90: 913-18 (1992), Leunk, Rev. Infect. Dis. 13: 5686-89 (1991); the H. pylori heat shock protein, see e.g., PCT Publ. No. WO 93/18150, Evans et al., Infect. Immun. 60: 2125-27 (1992), Dunn et al., Infect. Immun. 60: 1946-51 (1992), Austin et al., J. Bacteriol. 174: 7470-73 (1992); and the cytotoxin-associated protein, CagA, see e.g., PCT Publ. No. WO 93/18150, Covacci, A., et al., Proc. Natl. Acad. Sci. USA 90: 5791-95 (1993), Tummuru, M. K. et al., Infect. Immun. 61: 1799-1809 (1994).

Currently, H. pylori strains can be partitioned into at least two major groups, which either express (Type I) or do not express (Type II) the cytotoxin and the CagA proteins. Type I strains contain the CagA and toxin genes and produce active forms of these antigens. Type II strains lack the CagA locus and fail to express the cytotoxin. The association between the presence of the CagA gene and cytotoxicity suggests that the product of the CagA gene is necessary for the transcription, folding, export or function of the cytotoxin. Epidemiological analysis indicate that Type I bacteria are associated with duodenal ulcerations, gastric ulceration and sever forms of active gastritis.

For a general review of the pathogenic role of H. pylori in peptic ulcer, see Telford, J. L., et al., TibTech 12: 420-426 (1994).

SUMMARY OF THE INVENTION

The present invention describes nucleotide sequences located at the 5' region of the CagA gene. It has been found that the absence of the CagA gene in Type I strains is associated with the absence of genetic sequences located 5' of the CagA locus. This general region has been designated CagI and may encode virulence factors restricted to Type I strains. Sequences from this region were able to recognize homologous sequences in all Type I strains but failed to hybridize with DNA from Type II strains. Thus, this region has important implications for diagnosis of pathogenic Type I bacteria.

The present invention pertains not only to this CagI region, but also to recombinant materials associated therewith, such as vectors, host cells, and proteins encoded by such regions. The understanding at the molecular level of the nature and the role of this region and the availability of recombinant production has important implications for the development of new diagnostics for H. pylori and for the design of vaccines that may prevent H. pylori infection and treat disease. As such, this region has applications for diagnostics and vaccines. The present invention includes methods for diagnosing those diseases associated with H. pylori. As H. pylori has been associated with type B gastritis, peptic ulcers, and gastric adenocarcinoma, it is hoped that the present invention will assist in early detection and alleviation of these disease states.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of the H. pylori CagI region showing location of specific clones.

FIG. 2 is the nucleotide (SEQ ID NOS 1 and 5 (for FIG. 2, Seq 1), SEQ ID NOS 9 and 13 (for FIG. 2, Seq 2) and SEQ ID NOS 17 and 21 ( for FIG. 2, Seq. 3) and predicted amino acid sequences (SEQ ID NOS 2-4 and 6-8 (for FIG. 2, Seq 1), SEQ ID NOS 10-12 and 14-16 (for FIG. 2, Seq 2), and SEQ ID NOS 18-20 and 22-24 (for FIG. 2, Seq. 3), as well as restriction enzyme sites, for the H. pylori CagI locus, as represented by three sequence segments (see below).

FIGS. 3A through 3R is the complete nucleotide sequence for the H. pylori CagI locus (19,932 base pairs) (SEQ ID NO 25).

FIGS. 4A through 4N are the putative open reading frames (ORF) for the CagI locus and their possible homology with other proteins (SEQ ID NO 26-46).

DETAILED DESCRIPTION OF THE INVENTION

A. General Methodology

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See e.g., Sambrook, et al., MOLECULAR CLONING; A LABORATORY MANUAL, SECOND EDITION (1989); DNA CLONING, VOLUMES I AND II (D. N Glover ed. 1985); OLIGONUCLEOTIDE SYNTHESIS (M. J. Gait ed, 1984); NUCLEIC ACID HYBRIDIZATION (B. D. Hames & S. J. Higgins eds. 1984); TRANSCRIPTION AND TRANSLATION (B. D. Hames & S. J. Higgins eds. 1984); ANIMAL CELL CULTURE (R. I. Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J. H. Miller and M. P. Calos eds. 1987, Cold Spring Harbor Laboratory), Methods in Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., respectively), Mayer and Walker, eds. (1987), IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition (Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IMMUNOLOGY, VOLUMES I-IV (D. M. Weir and C. C. Blackwell eds 1986).

Standard abbreviations for nucleotides and amino acids are used in this specification. All publications, patents, and patent applications cited herein are incorporated by reference.

B. Definitions

"CagI" refers to the genetic regions located 5' to the CagA locus, whose absence correlates with the absence of the CagA gene in Type II bacterial strains. The nucleotide and putative amino acid sequences of CagI are shown in FIGS. 2, 3, and 4(SEQ ID NOS 1-46).

Examples of proteins encoded by the CagI region that can be used in the present invention include polypeptides with minor amino acid variations from the natural amino acid sequence of the protein; in particular, conservative amino acid replacements are contemplated. Conservative replacements are those that take place within a family of amino acids that are related in their side chains. Genetically encoded amino acids are generally divided into four families: (1) acidic=aspartate, glutamate; (2) basic=lysine, arginine, histidine; (3) non-polar=alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar=glycine, asparagine, glutamine, cystine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic amino acids. For example, it is reasonably predictable that an isolated replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar conservative replacement of an amino acid with a structurally related amino acid will not have a major effect on the biological activity. Polypeptide molecules having substantially the same amino acid sequence as the protein but possessing minor amino acid substitutions that do not substantially affect the functional aspects are within the definition of the protein.

A significant advantage of producing the protein by recombinant DNA techniques rather than by isolating and purifying a protein from natural sources is that equivalent quantities of the protein can be produced by using less starting material than would be required for isolating the protein from a natural source. Producing the protein by recombinant techniques also permits the protein to be isolated in the absence of some molecules normally present in cells. Indeed, protein compositions entirely free of any trace of human protein contaminants can readily be produced because the only human protein produced by the recombinant non-human host is the recombinant protein at issue. Potential viral agents from natural sources and viral components pathogenic to humans are also avoided.

The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all or a portion of a polynucleotide with which it is associated in nature, (2) is linked to a polynucleotide other than that to which it is linked in nature, or (3) does not occur in nature. Thus, this term also encompasses the situation wherein the H. pylori bacterium genome is genetically modified (e.g., through mutagenesis) to produce one or more altered polypeptides.

The term "polynucleotide" as used herein refers to a polymeric form of a nucleotide of any length, preferably deoxyribonucleotides, and is used interchangeably herein with the terms "oligonucleotide" and "oligomer." The term refers only to the primary structure of the molecule. Thus, this term includes double- and single-stranded DNA, as well as antisense polynucleotides. It also includes known types of modifications, for example, the presence of labels which are known in the art, methylation, end "caps," substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as, for example, replacement with certain types of uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, etc.) or charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), introduction of pendant moieties, such as, for example, proteins (including nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, radioactive species, boron, oxidative moieties, etc.), alkylators (e.g., alpha anomeric nucleic acids, etc.).

By "genomic" is meant a collection or library of DNA molecules which are derived from restriction fragments that have been cloned in vectors. This may include all or part of the genetic material of an organism.

By "cDNA" is meant a complimentary mRNA sequence that hybridizes to a complimentary strand of mRNA.

As used herein, the term "oligomer" refers to both primers and probes and is used interchangeably herein with the term "polynucleotide." The term oligomer does not connote the size of the molecule. However, typically oligomers are no greater than 1000 nucleotides, more typically are no greater than 500 nucleotides, even more typically are no greater than 250 nucleotides; they may be no greater than 100 nucleotides, and may be no greater than 75 nucleotides, and also may be no greater than 50 nucleotides in length.

The term "primer" as used herein refers to an oligomer which is capable of acting as a point of initiation of synthesis of a polynucleotide strand when used under appropriate conditions. The primer will be completely or substantially complementary to a region of the polynucleotide strand to be copied. Thus, under conditions conducive to hybridization, the primer will anneal to the complementary region of the analyte strand. Upon addition of suitable reactants, (e.g., a polymerase, nucleotide triphosphates, and the like), the primer will be extended by the polymerizing agent to form a copy of the analyte strand. The primer may be single-stranded or alternatively may be partially or fully double-stranded.

The terms "analyte polynucleotide" and "analyte strand" refer to a single- or double-stranded nucleic acid molecule which is suspected of containing a target sequence, and which may be present in a biological sample.

As used herein, the term "probe" refers to a structure comprised of a polynucleotide which forms a hybrid structure with a target sequence, due to complementarily of at least one sequence in the probe with a sequence in the target region. The polynucleotide regions of probes may be composed of DNA, and/or RNA, and/or synthetic nucleotide analogs. Included within probes are "capture probes" and "label probes".

As used herein, the term "target region" refers to a region of the nucleic acid which is to be amplified and/or detected. The term "target sequence" refers to a sequence with which a probe or primer will form a stable hybrid under desired conditions.

The term "capture probe" as used herein refers to a polynucleotide probe comprised of a single-stranded polynucleotide coupled to a binding partner. The single-stranded polynucleotide is comprised of a targeting polynucleotide sequence, which is complementary to a target sequence in a target region to be detected in the analyte polynucleotide. This complementary region is of sufficient length and complementarily to the target sequence to afford a duplex of stability which is sufficient to immobilize the analyte polynucleotide to a solid surface (via the binding partners). The binding partner is specific for a second binding partner; the second binding partner can be bound to the surface of a solid support, or may be linked indirectly via other structures or binding partners to a solid support.

The term "targeting polynucleotide sequence" as used herein refers to a polynucleotide sequence which is comprised of nucleotides which are complementary to a target nucleotide sequence; the sequence is of sufficient length and complementarily with the target sequence to form a duplex which has sufficient stability for the purpose intended.

The term "binding partner" as used herein refers to a molecule capable of binding a ligand molecule with high specificity, as for example an antigen and an antibody specific therefor. In general, the specific binding partners must bind with sufficient affinity to immobilize the analyte copy/complementary strand duplex (in the case of capture probes) under the isolation conditions. Specific binding partners are known in the art, and include, for example, biotin and avidin or streptavidin, IgG and protein A, the numerous known receptor-ligand couples, and complementary polynucleotide strands. In the case of complementary polynucleotide binding partners, the partners are normally at least about 15 bases in length, and may be at least 40 bases in length; in addition, they have a content of Gs and Cs of at least about 40% and as much as about 60%. The polynucleotides may be composed of DNA, RNA, or synthetic nucleotide analogs.

The term "coupled" as used herein refers to attachment by covalent bonds or by strong non-covalent interactions (e.g., hydrophobic interactions, hydrogen bonds, etc.). Covalent bonds may be, for example, ester, ether, phosphoester, amide, peptide, imide, carbon-sulfur bonds, carbon-phosphorus bonds, and the like.

The term "support" refers to any solid or semi-solid surface to which a desired binding partner may be anchored. Suitable supports include glass, plastic, metal, polymer gels, and the like, and may take the form of beads, wells, dipsticks, membranes, and the like.

The term "label" as used herein refers to any atom or moiety which can be used to provide a detectable (preferably quantifiable) signal, and which can be attached to a polynucleotide or polypeptide.

As used herein, the term "label probe" refers to a polynucleotide probe which is comprised of a targeting polynucleotide sequence which is complementary to a target sequence to be detected in the analyte polynucleotide. This complementary region is of sufficient length and complementarily to the target sequence to afford a duplex comprised of the "label probe" and the "target sequence" to be detected by the label. The label probe is coupled to a label either directly, or indirectly via a set of ligand molecules with high specificity for each other, including multimers.

The term "multimer," as used herein, refers to linear or branched polymers of the same repeating single-stranded polynucleotide unit or different single-stranded polynucleotide units. At least one of the units has a sequence, length, and composition that permits it to hybridize specifically to a first single-stranded nucleotide sequence of interest, typically an analyte or a polynucleotide probe (e.g., a label probe) bound to an analyte. In order to achieve such specificity and stability, this unit will normally be at least about 15 nucleotides in length, typically no more than about 50 nucleotides in length, and preferably about 30 nucleotides in length; moreover, the content of Gs and Cs will normally be at least about 40%, and at most about 60%. In addition to such unit(s), the multimer includes a multiplicity of units that are capable of hybridizing specifically and stably to a second single-stranded nucleotide of interest, typically a labeled polynucleotide or another multimer. These units are generally about the same size and composition as the multimers discussed above. When a multimer is designed to be hybridized to another multimer, the first and second oligonucleotide units are heterogeneous (different), and do not hybridize with each other under the conditions of the selected assay. Thus, multimers may be label probes, or may be ligands which couple the label to the probe.

A "replicon" is any genetic element, e.g., a plasmid, a chromosome, a virus, a cosmid, etc. that behaves as an autonomous unit of polynucleotide replication within a cell; i.e., capable of replication under its own control. This may include selectable markers.

"PCR" refers to the technique of polymerase chain reaction as described in Saiki, et al., Nature 324: 163 (1986); and Scharf et al., Science (1986) 233: 1076-1078; and U.S. Pat. No. 4,683,195; and U.S. Pat. No. 4,683,202.

As used herein, x is "heterologous" with respect to y if x is not naturally associated with y in the identical manner; i.e., x is not associated with y in nature or x is not associated with y in the same manner as is found in nature.

"Homology" refers to the degree of similarity between x and y. The correspondence between the sequence from one form to another can be determined by techniques known in the art. For example, they can be determined by a direct comparison of the sequence information of the polynucleotide. Alternatively, homology can be determined by hybridization of the polynucleotides under conditions which form stable duplexes between homologous regions (for example, those which would be used prior to S₁ digestion), followed by digestion with single-stranded specific nuclease(s), followed by size determination of the digested fragments.

A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring about the replication and/or expression of the attached segment.

"Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of coding sequences to which they are ligated. The nature of such control sequences differs depending upon the host organism; in prokaryotes, such control sequences generally include promoter, ribosomal binding site, and transcription termination sequence; in eukaryotes, generally, such control sequences include promoters and transcription termination sequence. The term "control sequences" is intended to include, at a minimum, all components whose presence is necessary for expression, and may also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences.

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner. A control sequence "operably linked" to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences.

An "open reading frame" (ORF) is a region of a polynucleotide sequence which encodes a polypeptide; this region may represent a portion of a coding sequence or a total coding sequence.

A "coding sequence" is a polynucleotide sequence which is translated into a polypeptide, usually via mRNA, when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a translation start codon at the 5'-terminus and a translation stop codon at the 3'-terminus. A coding sequence can include, but is not limited to, cDNA, and recombinant polynucleotide sequences.

As used herein, the term "polypeptide" refers to a polymer of amino acids and does not refer to a specific length of the product; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. This term also does not refer to or exclude post expression modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like. Included within the definition are, for example, polypeptides containing one or more analogs of an amino acid (including, for example, unnatural amino acids, etc.), polypeptides with substituted linkages, as well as other modifications known in the art, both naturally occurring and non-naturally occurring.

A polypeptide or amino acid sequence "derived from" a designated nucleic acid sequence refers to a polypeptide having an amino acid sequence identical to that of a polypeptide encoded in the sequence, or a portion thereof wherein the portion consists of at least 3-5 amino acids, and more preferably at least 8-10 amino acids, and even more preferably at least 11-15 amino acids, or which is immunologically identifiable with a polypeptide encoded in the sequence. This terminology also includes a polypeptide expressed from a designated nucleic acid sequence.

"Immunogenic" refers to the ability of a polypeptide to cause a humoral and/or cellular immune response, whether alone or when linked to a carrier, in the presence or absence of an adjuvant. "Neutralization" refers to an immune response that blocks the infectivity, either partially or fully, of an infectious agent.

"Epitope" refers to an antigenic determinant of a peptide, polypeptide, or protein; an epitope can comprise 3 or more amino acids in a spatial conformation unique to the epitope. Generally, an epitope consists of at least 5 such amino acids and, more usually, consists of at least 8-10 such amino acids. Methods of determining spatial conformation of amino acids are known in the art and include, for example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. Antibodies that recognize the same epitope can be identified in a simple immunoassay showing the ability of one antibody to block the binding of another antibody to a target antigen.

"Treatment," as used herein, refers to prophylaxis and/or therapy (i.e., the modulation of any disease symptoms). An "individual" indicates an animal that is susceptible to infection by H. pylori and includes, but is not limited to, primates, including humans. A "vaccine" is an immunogenic, or otherwise capable of eliciting protection against H. pylori, whether partial or complete, composition useful for treatment of an individual. As such, it may include but is not limited to a modified bacterium (e.g., chemical or genetic alterations); bacterial protein subunits, whether recombinantly produced or purified from cell lysates; bacterial genetic material employed in polynucleotide vaccines, etc.

H. pylori proteins may be used for producing antibodies, either monoclonal or polyclonal, specific to the proteins. The methods for producing these antibodies are known in the art.

"Recombinant host cells", "host cells," "cells," "cell cultures," and other such terms denote, for example, microorganisms, insect cells, and mammalian cells, that can be, or have been, used as recipients for recombinant vector or other transfer DNA, and include the progeny of the original cell which has been transformed. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total DNA complement as the original parent, due to natural, accidental, or deliberate mutation. Examples for mammalian host cells include Chinese hamster ovary (CHO) and monkey kidney (COS) cells.

Specifically, as used herein, "cell line," refers to a population of cells capable of continuous or prolonged growth and division in vitro. Often, cell lines are clonal populations derived from a single progenitor cell. It is further known in the art that spontaneous or induced changes can occur in karyotype during storage or transfer of such clonal populations. Therefore, cells derived from the cell line referred to may not be precisely identical to the ancestral cells or cultures, and the cell line referred to includes such variants. The term "cell lines" also includes immortalized cells. Preferably, cell lines include nonhybrid cell lines or hybridomas to only two cell types.

As used herein, the term "microorganism" includes prokaryotic and eukaryotic microbial species such as bacteria and fungi, the latter including yeast and filamentous fungi.

"Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, irrespective of the method used for the insertion, for example, direct uptake, transduction, f-mating or electroporation. The exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the host genome.

By "purified" and "isolated" is meant, when referring to a polypeptide or nucleotide sequence, that the indicated molecule is present in the substantial absence of other biological macromolecules of the same type. The term "purified" as used herein preferably means at least 75% by weight, more preferably at least 85% by weight, more preferably still at least 95% by weight, and most preferably at least 98% by weight, of biological macromolecules of the same type present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000, can be present).

C. Nucleic Acid Assays

Using as a basis the genome of H. pylori, and more particularly, the genetic region of CagI, polynucleotide probes of approximately 8 nucleotides or more can be prepared which hybridize with the positive strand(s) of the RNA or its complement, as well as to cDNAs. These polynucleotides serve as probes for the detection, isolation and/or labeling of polynucleotides which contain nucleotide sequences, and/or as primers for the transcription and/or replication of the targeted sequences. Each probe contains a targeting polynucleotide sequence, which is comprised of nucleotides which are complementary to a target nucleotide sequence; the sequence is of sufficient length and complementarily with the sequence to form a duplex which has sufficient stability for the purpose intended. For example, if the purpose is the isolation, via immobilization, of an analyte containing a target sequence, the probes will contain a polynucleotide region which is of sufficient length and complementarily to the targeted sequence to afford sufficient duplex stability to immobilize the analyte on a solid surface under the isolation conditions. For example, also, if the polynucleotide probes are to serve as primers for the transcription and/or replication of target sequences, the probes will contain a polynucleotide region of sufficient length and complementarily to the targeted sequence to allow for replication. For example, also, if the polynucleotide probes are to be used as label probes, or are to bind to multimers, the targeting polynucleotide region would be of sufficient length and complementarily to form stable hybrid duplex structures with the label probes and/or multimers to allow detection of the duplex. The probes may contain a minimum of about 4 contiguous nucleotides which are complementary to the targeted sequence; usually the oligomers will contain a minimum of about 8 continuous nucleotides which are complementary to the targeted sequence, and preferably will contain a minimum of about 14 contiguous nucleotides which are complementary to the targeted sequence.

The probes, however, need not consist only of the sequence which is complementary to the targeted sequence. They may contain additional nucleotide sequences or other moieties. For example, if the probes are to be used as primers for the amplification of sequences via PCR, they may contain sequences which, when in duplex, form restriction enzyme sites which facilitate the cloning of the amplified sequences. For example, also, if the probes are to be used as "capture probes" in hybridization assays, they will be coupled to a "binding partner" as defined above. Preparation of the probes is by means known in the art, including, for example, by methods which include excision, transcription or chemical synthesis.

D. Expression Systems

Once the appropriate H. pylori coding sequence is isolated, it can be expressed in a variety of different expression systems; for example those used with mammalian cells, baculoviruses, bacteria, and yeast.

i. Mammalian Systems

Mammalian expression systems are known in the art. A mammalian promoter is any DNA sequence capable of binding mammalian RNA polymerase and initiating the downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a transcription initiating region, which is usually placed proximal to the 5' end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA synthesis at the correct site. A mammalian promoter will also contain an upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. An upstream promoter element determines the rate at which transcription is initiated and can act in either orientation, Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd ed (1989).

Mammalian viral genes are often highly expressed and have a broad host range; therefore sequences encoding mammalian viral genes provide particularly useful promoter sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In addition, sequences derived from non-viral genes, such as the murine metallotheionein gene, also provide useful promoter sequences. Expression may be either constitutive or regulated (inducible), depending on the promoter can be induced with glucocorticoid in hormone-responsive cells.

The presence of an enhancer element (enhancer), combined with the promoter elements described above, will usually increase expression levels. An enhancer is a regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to homologous or heterologous promoters, with synthesis beginning at the normal RNA start site. Enhancers are also active when they are placed upstream or downstream from the transcription initiation site, in either normal or flipped orientation, or at a distance of more than 1000 nucleotides from the promoter, Maniatis et al., Science 236: 1237 (1989); Alberts et al. Molecular Biology of the Cell, 2nd ed (1989). Enhancer elements derived from viruses may be particularly useful, because they usually have a broader host range. Examples include the SV40 early gene enhancer, Dijkema et al (1985) EMBO J. 4: 761, and the enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus, Gorman et al. (1982) Proc. Natl. Acad. Sci. 79: 6777, and from human cytomegalovirus, Boshart et al. (1985) Cell 41: 5221. Additionally, some enhancers are regulatable and become active only in the presence of an inducer, such as a hormone or metal ion, Sassone-Corsi et al. (1986) Trends Genet. 2: 215; Maniatis et al. (1987) Science 236: 1237.

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter sequence may be directly linked with the DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will always be a methionine, which is encoded by the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen bromide.

Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric DNA molecules that encode a fusion protein comprised of a leader sequence fragment that provides for secretion of the foreign protein in mammalian cells. Preferably, there are processing sites encoded between the leader fragment and the foreign gene that can be cleaved eiLher in vivo or in vitro. The leader sequence fragment usually encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence that provides for secretion of a foreign protein in mammalian cells.

Usually, transcription termination and polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' to the translation stop codon and thus, together with the promoter elements, flank the coding sequence. The 3' terminus of the mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation, Birnstiel et al. (1985) Cell 41: 349; Proudfoot and Whitelaw (1988) "Termination and 3' end processing of eukaryotic RNA. In Transcription and splicing (ed. B. D. Hames and D. M. Glover); Proudfoot (1989) Trends Biochem. Sci. 14: 105. These sequences direct the transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. Examples of transcription terminator/polyadenylation signals include those derived from SV40, Sambrook et al (1989), Molecular Cloning: A Laboratory Manual.

Some genes may be expressed more efficiently when introns (also called intervening sequences) are present. Several cDNAs, however, have been efficiently expressed from vectors that lack splicing signals (also called splice donor and acceptor sites), see e.g., Gething and Sambrook (1981) Nature 293: 620. Introns are intervening noncoding sequences within a coding sequence that contain splice donor and acceptor sites. They are removed by a process called "splicing," following polyadenylation of the primary transcript, Nevins (1983) Annu. Rev. Biochem. 52: 441; Green (1986) Annu. Rev. Genet. 20: 671; Padgett et al. (1986) Annu. Rev. Biochem. 55: 1119; Krainer and Maniatis (1988) "RNA splicing," In Transcription and splicing (ed. B. D. Hames and D. M. Glover).

Usually, the above-described components, comprising a promoter, polyadenylation signal, and transcription termination sequence are put together into expression constructs. Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may also be included in an expression construct, if desired. Expression constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication systems include those derived from animal viruses, which require trans-acting factors to replicate. For example, plasmids containing the replication systems of papovaviruses, such as SV40, Gluzman (1981) Cell 23: 175, or polyomavirus, replicate to extremely high copy number in the presence of the appropriate viral T antigen. Additional examples of mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr virus. Additionally, the replicon may have two replication systems, thus allowing it to be maintained, for example, in mammalian cells for expression and in a procaryotic host for cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include pMT2, Kaufman et al. (1989) Mol. Cell. Biol. 9: 946, and pHEBO, Shimizu et al. (1986) Mol. Cell. Biol. 6: 1074.

The transformation procedure used depends upon the host to be transformed. Methods for introduction of heterologous polynucleotides into mammalian cells are known in the art and include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei.

Mammalian cell lines available as hosts for expression are known in the art and include many immortalized cell lines available from the American Type Culture Collection (ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma cells (e.g., Hep G2), and a number of other cell lines.

ii. Baculovirus Systems

The polynucleotide encoding the protein can also be inserted into a suitable insect expression vector, and is operably linked to the control elements within that vector. Vector construction employs techniques which are known in the art.

Generally, the components of the expression system include a transfer vector, usually a bacterial plasmid, which contains both a fragment of the baculovirus genome, and a convenient restriction site for insertion of the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence homologous to the baculovirus-specific fragment in the transfer vector (this allows for the homologous recombination of the heterologous gene in to the baculovirus genome); and appropriate insect host cells and growth media.

After inserting the DNA sequence encoding the protein into the transfer vector, the vector and the wild type viral genome are transfected into an insect host cell where the vector and viral genome are allowed to recombine. The packaged recombinant virus is expressed and recombinant plaques are identified and purified. Materials and methods for baculovirus/insect cell expression systems are commercially available in kit form from, inter alia, Invitrogen, San Diego Calif. ("MaxBac" kit). These techniques are generally known to those skilled in the art and fully described in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith").

Prior to inserting the DNA sequence encoding the protein into the baculovirus genome, the above-described components, comprising a promoter, leader (if desired), coding sequence of interest, and transcription termination sequence, are usually assembled into an intermediate transplacement construct (transfer vector). This construct may contain a single gene and operably linked regulatory elements; multiple genes, each with its owned set of operably linked regulatory elements; or multiple genes, regulated by the same set of regulatory elements. Intermediate transplacement constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to be maintained in a suitable host for cloning and amplification.

Currently, the most commonly used transfer vector for introducing foreign genes into AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been designed. These include, for example, pVL985 (which alters the polyhedrin start codon from ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the ATT; see Luckow and Summers, Virology (1989) 17: 31.

The plasmid usually also contains the polyhedron polyadenylation signal (Miller et al. (1988) Ann. Rev. Microbiol., 42: 177) and a procaryotic ampicillin-resistance (amp) gene and origin of replication for selection and propagation in E. coli.

Baculovirus transfer vectors usually contain a baculovirus, promoter. A baculovirus promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and initiating the downstream (5' to 3') transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a transcription initiation region which is usually placed proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site and a transcription initiation site. A baculovirus transfer vector may also have a second domain called an enhancer, which, if present, is usually distal to the structural gene. Expression may be either regulated or constitutive.

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide particularly useful promoter sequences. Examples include sequences derived from the gene encoding the viral polyhedron protein, Friesen et al., (1986) "The Regulation of Baculovirus Gene Expression," in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO Publ. Nos. 127 839 and 155 476; and the gene encoding the p10 protein, Vlak et al., (1988), J. Gen. Virol. 69: 765.

DNA encoding suitable signal sequences can be derived from genes for secreted insect or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) Gene, 73: 409). Alternatively, since the signals for mammalian cell posttranslational modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) appear to be recognized by insect cells, and the signals required for secretion and nuclear accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, leaders of non-insect origin, such as those derived from genes encoding human α-interferon, Maeda et al., (1985), Nature 315: 592; human gastrin-releasing peptide, Lebacq-Verheyden et al., (1988), Molec. Cell. Biol. 8: 3129; human IL-2, Smith et al., (1985) Proc. Nat'l Acad. Sci. USA, 82: 8404; mouse IL-3, (Miyajima et al., (1987) Gene 58: 273; and human glucocerebrosidase, Martin et al. (1988) DNA 7: 99, can also be used to provide for secretion in insects.

A recombinant polypeptide or polyprotein may be expressed intracellularly or, if it is expressed with the proper regulatory sequences, it can be secreted. Good intracellular expression of nonfused foreign proteins usually requires heterologous genes that ideally have a short leader sequence containing suitable translation initiation signals preceding an ATG start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein by in vitro incubation with cyanogen bromide.

Alternatively, recombinant polyproteins or proteins which are not naturally secreted can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion protein comprised of a leader sequence fragment that provides for secretion of the foreign protein in insects. The leader sequence fragment usually encodes a signal peptide comprised of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic reticulum.

After insertion of the DNA sequence and/or the gene encoding the expression product precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of the transfer vector and the genomic DNA of wild type baculovirus--usually by co-transfection. The promoter and transcription termination sequence of the construct will usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing heterologous DNA into the desired site in the baculovirus virus are known in the art. (See Summers and Smith; Ju et al. (1987); Smith et al., Mol. Cell. Biol. (1983) 3: 2156; and Luckow and Summers (1989)). For example, the insertion can be into a gene such as the polyhedrin gene, by homologous double crossover recombination; insertion can also be into a restriction enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), Bioessays 4: 91.

The DNA sequence, when cloned in place of the polyhedrin gene in the expression vector, is flanked both 5' and 3' by polyhedrin-specific sequences and is positioned downstream of the polyhedrin promoter.

The newly formed baculovirus expression vector is subsequently packaged into an infectious recombinant baculovirus. Homologous recombination occurs at low frequency (between about 1% and about 5%); thus, the majority of the virus produced after cotransfection is still wild-type virus. Therefore, a method is necessary to identify recombinant viruses. An advantage of the expression system is a visual screen allowing recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the native virus, is produced at very high levels in the nuclei of infected cells at late times after viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain embedded particles. These occlusion bodies, up to 15 μm in size, are highly refractile, giving them a bright shiny appearance that is readily visualized under the light microscope. Cells infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells by techniques known to those skill.ed in the art. Namely, the plaques are screened under the light microscope for the presence (indicative of wild-type virus) or absence (indicative of recombinant virus) of occlusion bodies. "Current Protocols in Microbiology" Vol. 2 (Ausubel et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith; Miller et al. (1989).

Recombinant baculovirus expression vectors have been developed for infection into several insect cells. For example, recombinant baculoviruses have been developed for, inter alia: Aedes aegypti, Autographa californica, Bombvx mori, Drosophila melanogaster, Spodoptera frugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., (1985) J. Virol. 56: 153; Wright (1986) Nature 321: 718; Smith et al., (1983) Mol. Cell. Biol. 3: 2156; and see generally, Fraser, et al. (1989) In Vitro Cell. Dev. Biol. 25: 225).

Cells and cell culture media are commercially available for both direct and fusion expression of heterologous polypeptides in a baculovirus/expression system; cell culture technology is generally known to those skilled in the art. See, e.g., Summers and Smith.

The modified insect cells may then be grown in an appropriate nutrient medium, which allows for stable maintenance of the plasmid(s) present in the modified insect host. Where the expression product gene is under inducible control, the host may be grown to high density, and expression induced. Alternatively, where expression is constitutive, the product will be continuously expressed into the medium and the nutrient medium must be continuously circulated, while removing the product of interest and augmenting depleted nutrients. The product may be purified by such techniques as chromatography, e.g., HPLC, affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient centrifugation; solvent extraction, or the like. As appropriate, the product may be further purified, as required, so as to remove substantially any insect proteins which are also secreted in the medium or result from lysis of insect cells, so as to provide a product which is at least substantially free of host debris, e.g., proteins, lipids and polysaccharides.

In order to obtain protein expression, recombinant host cells derived from the transformants are incubated under conditions which allow expression of the recombinant protein encoding sequence. These conditions will vary, dependent upon the host cell selected. However, the conditions are readily ascertainable to those of ordinary skill in the art, based upon what is known in the art.

iii. Bacterial Systems

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA sequence capable of binding bacterial RNA polymerase and initiating the downstream (3") transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a transcription initiation region which is usually placed proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site and a transcription initiation site. A bacterial promoter may also have a second domain called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene repressor protein may bind the operator and thereby inhibit transcription of a specific gene. Constitutive expression may occur in the absence of negative regulatory elements, such as the operator. In addition, positive regulation may be achieved by a gene activator protein binding sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. An example of a gene activator protein is the catabolite activator protein (CAP), which helps initiate transcription of the lac operon in E. coli, Raibaud et al. (1984) Annu. Rev. Genet. 18: 173. Regulated expression may therefore be either positive or negative, thereby either enhancing or reducing transcription.

Sequences encoding metabolic pathway enzymes provide particularly useful promoter sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose (lac), Chang et al. (1977) Nature 198: 1056, and maltose. Additional examples include promoter sequences derived from biosynthetic enzymes such as tryptophan (trp), Goeddel et al. (1980) Nuc. Acids Res. 8: 4057; Yelverton et al. (1981) Nucl. Acids Res. 9: 731; U.S. Pat. No. 4,738,921; EPO Publ. Nos. 036 776 and 121 775. The beta-glacotamase (bla) promoter system, Weissmann (1981) "The cloning of interferon and other mistakes." In Interferon 3 (ed. I. Gresser), bacteriophage lambda PL, Shimatake et al. (1981) Nature 292: 128, and T5, U.S. Pat. No. 4,689,406, promoter systems also provide useful promoter sequences.

In addition, synthetic promoters which do not occur in nature also function as bacterial promoters. For example, transcription activation sequences of one bacterial or bacteriophage promoter may be joined with the operon sequences of another bacterial or bacteriophage promoter, creating a synthetic hybrid promoter, U.S. Pat. No. 4,551,433. For example, the tac promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon sequences that is regulated by the lac repressor, Amann et al. (1983) Gene 25: 167; de Boer et al. (1983) Proc. Natl. Acad. Sci. 80: 21. Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial origin can also be coupled with a compatible RNA polymerase to produce high levels of expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter system is an example of a coupled promoter system, Studier et al. (1986) J. Mol. Biol. 189: 113; Tabor et al. (1985) Proc Natl. Acad. Sci. 82: 1074. In addition, a hybrid promoter can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. No. 267 851).

In addition to a functioning promoter sequence, an efficient ribosome binding site is also useful for the expression of foreign genes in prokaryotes. In E. coli, the ribosome binding site is called the Shine-Dalgarno (SD) sequence and includes an initiation codon (ATG) and a sequence 3-9 nucleotides in length located 3-11 nucleotides upstream of the initiation codon, Shine et al. (1975) Nature 254: 34. The SD sequence is thought to promote binding of mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' and of E. coli 16S rRNA, Steitz et al. (1979) "Genetic signals and nucleotide sequences in messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R. F. Goldberger). To express eukaryotic genes and prokaryotic genes with weak ribosome-binding site, Sambrook et al. (1989), Molecular Cloning: A Laboratorv Manual.

A DNA molecule may be expressed intracellularly. A promoter sequence may be directly linked with the DNA molecule, in which case the first amino acid at the N-terminus will always be a methionine, which is encoded by the ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen bromide or by either in vivo on in vitro incubation with a bacterial methionine N-terminal peptidase (EPO Publ. No. 219 237).

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein, is fused to the 5' end of heterologous coding sequences. Upon expression, this construct will provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the bacteriophage protein from the foreign gene, Nagai et al. (1984) Nature 309: 810. Fusion proteins can also be made with sequences from the lacZ, Jia et al. (1987) Gene 60: 197, trpE, Allen et al. (1987) J. Biotechnol. 5: 93; Makoff et al. (1989) J. Gen. Microbiol. 135: 11, and EPO Publ. No. 324 647, genes. The DNA sequence at the junction of the two amino acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin from the foreign protein. Through this method, native foreign protein can be isolated. Miller et al. (1989) Bio/Technology 7: 698.

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment that provides for secretion of the foreign protein in bacteria, U.S. Pat. No. 4,336,336. The signal sequence fragment usually encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. The protein is either secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located between the inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide fragment and the foreign gene.

DNA encoding suitable signal sequences can be derived from genes for secreted bacterial proteins, such as the E. coli outer membrane protein gene (ompA). Masui et al. (1983), in: Experimental Manipulation of Gene Expression; Ghrayeb et al. (1984) EMBO J. 3: 2437 and the E. coli alkaline phosphatase signal sequence (phoA), Oka et al. (1985) Proc. Natl. Acad. Sci. 82: 7212. As an additional example, the signal sequence of the alpha-amylase gene from various Bacillus strains can be used to secrete heterologous proteins from B. subtilis. Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79: 5582; EPO Publ. No. 244 042.

Usually, transcription termination sequences recognized by bacteria are regulatory regions located 3' to the translation stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. Transcription termination sequences frequently include DNA sequences of about 50 nucleotides capable of forming stem loop structures that aid in terminating transcription. Examples include transcription termination sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as other biosynthetic genes.

Usually, the above-described components, comprising a promoter, signal sequence (if desired), coding sequence of interest, and transcription termination sequence, are put together into expression constructs. Expression constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as bacteria. The replicon will have a replication system, thus allowing it to be maintained in a procaryotic host either for expression or for cloning and amplification. In addition, a replicon may be either a high or low copy number plasmid. A high copy number plasmid will generally have a copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host containing a high copy number plasmid will preferably contain at least about 10, and more preferably at least about 20 plasmids. Either a high or low copy number vector may be selected, depending upon the effect of the vector and the foreign protein on the host.

Alternatively, the expression constructs can be integrated into the bacterial genome with an integrating vector. Integrating vectors usually contain at least one sequence homologous to the bacterial chromosome that allows the vector to integrate. Integrations appear to result from recombinations between homologous DNA in the vector and the bacterial chromosome. For example, integrating vectors constructed with DNA from various Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating vectors may also be comprised of bacteriophage or transpose sequences.

Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow for the selection of bacterial strains that have been transformed. Selectable markers can be expressed in the bacterial host and may include genes which render bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin (neomycin), and tetracycline. Davies et al. (1978) Annu. Rev. Microbiol. 32: 469. Selectable markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and leucine biosynthetic pathways.

Alternatively, some of the above-described components can be put together in transformation vectors. Transformation vectors are usually comprised of a selectable marker that is either maintained in a replicon or developed into an integrating vector.

Expression and transformation vectors, either extra-chromosomal replicons or integrating vectors, have been developed for transformation into many bacteria. For example, expression vectors have been developed for, inter alia, the following bacteria: Bacillus subtilis, Palv et al. (1982) Proc. Natl. Acad. Sci. USA 79: 5582; EPO Publ. Nos. 036 259 and 063 953; PCT Publ. No. WO 84/04541; E. coli , Shimatake et al. (1981) Nature 292: 128; Amann et al. (1985) Gene 40: 183; Studier et al. (1986) J. Mol. Biol. 189: 113; EPO Publ. Nos. 036 776, 136 829 and 136 907; Streptococcus cremoris, Powell et al. (1988) Appl. Environ. Microbiol. 54: 655; Streptococcus lividans, Powell et al. (1988) Appl. Environ. Microbiol. 54: 655; and Streptomyces lividans, U.S. Pat. No. 4,745,056.

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, and usually include either the transformation of bacteria treated with CaCl₂ or other agents, such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by electroporation. Transformation procedures usually vary with the bacterial species to be transformed. See, e.g., Masson et al. (1989) FEMS Microbiol. Lett. 60: 273; Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79: 5582; EPO Publ. Nos. 036 259 and 063 953; PCT Publ. No. WO 84/04541, for Bacillus; Miller et al. (1988) Proc. Natl. Acad. Sci. 85: 856; Wang et al. (1990) J. Bacteriol. 172: 949, for Campylobacter; Cohen et al. (1973) Proc. Natl. Acad. Sci. 69: 2110; Dower et al. (1988) Nucleic Acids Res. 16:, 6127; Kushner (1978) "An improved method for transformation of E. coli with ColE1-derived plasmids," In Genetic Engineering: Proceedings of the International Symposium on Genetic Engineering (eds. H. W. Boyer and S. Nicosia); Mandel et al. (1970) J. Mol. Biol. 53: 159; Taketo (1988) Biochim. Biophys. Acta 949: 318, for Escherichia; Chassy et al. (1987) FEMS Microbiol. Lett. 44: 173, for Lactobacillus; Fiedler et al. (1988) Anal. Biochem 170: 38, for Pseudomonas; Augustin et al. (1990) FEMS Microbiol. Lett. 66: 203, for Staphylococcus; Barany et al. (1980) J. Bacteriol. 144: 698; Harlander (1987) "Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. Ferretti and R. Curtiss III); Perry et al. (1981) Infec. Immun. 32: 1295; Powell et al. (1988) Appl. Environ. Microbiol. 54: 655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology 1: 412, for Streptococcus.

iv. Yeast Expression

Yeast expression systems are also known to one of ordinary skill in the art. A yeast promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a transcription initiation region which is usually placed proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast promoter may also have a second domain called an upstream activator sequence (UAS), which, if present, is usually distal to the structural gene. The UAS permits regulated (inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated expression may be either positive or negative, thereby either enhancing or reducing transcription.

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase (GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PHO5 gene, encoding acid phosphatase, also provides useful promoter sequences, Myanohara et al. (1983) Proc. Natl. Acad. Sci. USA 80: 1.

In addition, synthetic promoters which do not occur in nature also function as yeast promoters. For example, UAS sequences of one yeast promoter may be joined with the transcription activation region of another yeast promoter, creating a synthetic hybrid promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to the GAP transcription activation region (U.S. Pat. No. 4,876,197 and U.S. Pat. No. 4,880,734). Other examples of hybrid promoters include promoters which consist of the regulatory sequences of either the ADH2, GAL4, GAL10, or PHO5 genes, combined with the transcriptional activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of such promoters include, inter alia, Cohen et al. (1980) Proc. Natl. Acad. Sci. USA 77: 1078; Henikoff et al. (1981) Nature 283: 835; Hollenberg et al. (1981) Curr. Topics Microbiol. Immunol. 96: 119; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical, Environmental and Commercial Importance (eds. K. N. Timmis and A. Puhler); Mercerau-Puigalon et al. (1980) Gene 11: 163; Panthier et al. (1980) Curr. Genet. 2: 109.

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may be directly linked with the DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will always be a methionine, which is encoded by the ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen bromide.

Fusion proteins provide an alternative for yeast expression systems, as well as in mammalian, baculovirus, and bacterial expression systems. Usually, a DNA sequence encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is fused to the 5' end of heterologous coding sequences. Upon expression, this construct will provide a fusion of the two amino acid sequences. For example, the yeast or human superoxide dismutase (SOD) gene, can be linked at the 5' terminus of a foreign gene and expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may or may not encode a cleavable site. See e.g., EPO Publ. No. 196 056. Another example is a ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) to cleave the ubiquitin from the foreign protein. Through this method, therefore, native foreign protein can be isolated (see, e.g., PCT Publ. No. WO 88/024066).

Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric DNA molecules that encode a fusion protein comprised of a leader sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there are processing sites encoded between the leader fragment and the foreign gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell.

DNA encoding suitable signal sequences can be derived from genes for secreted yeast proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 62,096,086) and the A-factor gene (U.S. Pat. No. 4,588,684). Alternatively, leaders of non-yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO Publ. No. 060 057).

A preferred class of secretion leaders are those that employ a fragment of the yeast alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types of alpha-factor fragments that can be employed include the full-length pre-pro alpha factor leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 25 to about 50 amino acid residues) (U.S. Pat. No. 4,546,083 and U.S. Pat. No. 4,870,008; EPO Publ. No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.)

Usually, transcription termination sequences recognized by yeast are regulatory regions located 3' to the translation stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. Examples of transcription terminator sequence and other yeast-recognized termination sequences, such as those coding for glycolytic enzymes.

Usually, the above-described components, comprising a promoter, leader (if desired), coding sequence of interest, and transcription termination sequence, are put together into expression constructs. Expression constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as yeast or bacteria. The replicon may have two replication systems, thus allowing it to be maintained, for example, in yeast for expression and in a procaryotic host for cloning and amplification. Examples of such yeast-bacteria shuttle vectors include YEp24, Botstein et al. (1979) Gene 8: 17-24; pCl/1, Brake et al. (1984) Proc. Natl. Acad. Sci USA 81: 4642-4646; and YRp17, Stinchcomb et al. (1982) J. Mol. Biol. 158: 157. In addition, a replicon may be either a high or low copy number plasmid. A high copy number plasmid will generally have a copy number ranging from about 5 to about 200, and usually about 10 to about 150. A host containing a high copy number plasmid will preferably have at least about 10, and more preferably at least about 20. A high or low copy number vector may be selected, depending upon the effect of the vector and the foreign protein on the host.

Alternatively, the expression constructs can be integrated into the yeast genome with an integrating vector. Integrating vectors usually contain at least one sequence homologous to a yeast chromosome that allows the vector to integrate, and preferably contain two homologous sequences flanking the expression construct. Integrations appear to result from recombinations between homologous DNA in the vector and the yeast chromosome, Orr-Weaver et al. (1983) Methods in Enzymol. 101: 228-245. An integrating vector may be directed to a specific locus in yeast by selecting the appropriate homologous sequence for inclusion in the vector. One or more expression construct may integrate, possibly affecting levels of recombinant protein produced, Rine et al. (1983) Proc. Natl. Acad. Sci. USA 80: 6750. The chromosomal sequences included in the vector can occur either as a single segment in the vector, which results in the integration of the entire vector, or two segments homologous to adjacent segments in the chromosome and flanking the expression construct in the vector, which can result in the stable integration of only the expression construct.

Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow for the selection of yeast strains that have been transformed. Selectable markers may include biosynthetic genes that can be expressed in the yeast host, such as ADE2, HIS4, LEU2, TRP1, and ALG7, and the G418 resistance gene, which confer resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable selectable marker may also provide yeast with the ability to grow in the presence of toxic compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the presence of copper ions. Butt et al. (1987) Microbiol, Rev. 51: 351.

Alternatively, some of the above-described components can be put together into transformation vectors. Transformation vectors are usually comprised of a selectable marker that is either maintained in a replicon or developed into an integrating vector.

Expression and transformation vectors, either extrachromosomal replicons or integrating vectors, have been developed for transformation into many yeasts. For example, expression vectors have been developed for, inter alia, the following yeasts: Candida albicans, Kurtz, et al. (1986) Mol. Cell. Biol. 6: 142; Candida maltosa, Kunze, et al. (1985) J. Basic Microbiol. 25: 141; Hansenula polymorpha, Gleeson, et al. (1986) J. Gen. Microbiol. 132: 3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 202: 302; Kluyveromyces fragilis, Das, et al. (1984) J. Bacteriol. 158: 1165; Kluyveromyces lactis, De Louvencourt et al. (1983) J. Bacteriol. 154: 737; Van den Berg et al. (1990) Bio/Technology 8: 135; Pichia guillerimondii, Kunze et al. (1985) J. Basic Microbiol. 25: 141; Pichia pastoris, Cregg, et al. (1985) Mol. Cell. Biol. 5: 3376; U.S. Pat. No. 4,837,148 and U.S. Pat. No. 4,929,555; Saccharomyces cerevisiae, Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75: 1929; Ito et al. (1983) J. Bacteriol. 153: 163; Schizosaccharomyces pombe, Beach et al. (1981) Nature 300: 706; and Yarrowia lipolytica, Davidow, et al. (1985) Curr. Genet. 10: 38047l Gaillardin, et al. (1985) Curr. Genet. 10: 49.

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, and usually include either the transformation of spheroplasts or of intact yeast cells treated with alkali cations. Transformation procedures usually vary with the yeast species to be transformed. See e.g., Kurtz et al. (1986) Mol. Cell. Biol. 6: 142; Kunze et al. (1985) J. Basic Microbiol. 25: 141, for Candida; Gleeson et al. (1986) J. Gen. Microbioy. 132: 3459; Roggenkamp et al. (1986) Mol. Gen. Genet. 202: 302, for Hansenula; Das et al. (1984) J. Bacteriol. 158: 1165; De Louvencourt et al. (1983) J. Bacteriol. 154: 1165; Van den Berg et al. (1990) Bio/Technology 8: 135, for Kluyveromyces; Cregg et al. (1985) Mol. Cell. Biol. 5: 3376; Kunze et al. (1985) J. Basic Microbiol. 25: 141; U.S. Pat. No. 4,837,148 and U.S. Pat. No. 4,929,555, for Pichia; Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75; 1929; Ito et al. (1983) J. Bacteriol. 153: 163, for Saccharomyces; Beach et al. (1981) Nature 300: 706, for Schizosaccharomvces; Davidow et al. (1985) Curr. Genet. 10: 39; Gaillardin et al. (1985) Curr. Genet. 10: 49, for Yarrowia.

E. Vaccines

Proteins encoded by the CagI region (as well as DNA from the region itself in the form of polynucleotide vaccines) may be used as a sole vaccine candidate or in combination with one or more other antigens, the latter either from H. pylori or other pathogenic sources. Preferred are "cocktail" vaccines comprising, for example, the cytotoxin (CT or VacA) antigen, the CagA protein, and the urease. Additionally, the hsp can be added to one or more of these components. These vaccines may either be prophylactic (to prevent infection) or therapeutic (to treat disease after infection).

Such vaccines comprise H. pylori antigen or antigens, usually in combination with "pharmaceutically acceptable carriers", which include any carrier that does not itself induce the production of antibodies harmful to the individual receiving the composition. Suitable carriers are typically large, slowly metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or liposomes), and inactive virus particles. Such carriers are well known to those of ordinary skill in the art. Additionally, these carriers may function as immunostimulating agents ("adjuvants"). Furthermore, the antigen may be conjugated to a bacterial toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens.

Preferred adjuvants to enhance effectiveness of the composition include, but are not limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall components), such as for example (a) MF59 (and other submicron oil-in-water emulsions described in PCT Publ. No. WO 90/14837), containing 5% Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of MTP-PE (see below), although not required) formulated into submicron particles using a microfluidizer such as Model 110Y microfluidizer (Microfluidics, Newton, Mass.), (b) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP (see below) either microfluidized into a submicron emulsion or vortexed to generate a larger particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, Mont.) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall components from the group consisting of monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL+CWS (Detox™); (3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, Mass.) may be used or particles generated therefrom such as ISCOMs (immunostimulating complexes); (4) Complete Freunds Adjuvant (CFA) and Incomplete Freunds Adjuvant (IFA); (5) cytokines, such as interleukins (IL-1, IL-2, etc.), macrophage colony stimulating factor (M-CSF), tumor necrosis factor (TNF), etc; and (6) other substances that act as immunostimulating agents to enhance the effectiveness of the composition. Alum and MF59 are preferred.

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D-isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1'-2'-dipalmitoyl-sn-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc.

The immunogenic compositions (e.g., the antigen or polynucleotide, pharmaceutically acceptable carrier, and adjuvant) typically will contain diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles.

Typically, the immunogenic compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be prepared. The preparation also may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed above under pharmaceutically acceptable carriers.

Immunogenic compositions used as vaccines comprise an immunologically effective amount of the antigenic polypeptides or nucleotides, as well as any other of the above-mentioned components, as needed. By "immunologically effective amount", it is meant that the administration of that amount to an individual, either in a single dose or as part of a series, is effective for treatment or prevention. This amount varies depending upon the health and physical condition of the individual to be treated, the taxonomic group of individual to be treated (e.g., nonhuman primate, primate, etc.), the capacity of the individual's immune system to synthesize antibodies, the degree of protection desired, the formulation of the vaccine, the treating doctor's assessment of the medical situation, and other relevant factors. It is expected that the amount will fall in a relatively broad range that can be determined through routine trials.

The immunogenic compositions are conventionally administered parenterally, e.g., by injection, either subcutaneously or intramuscularly. Additional formulations suitable for other modes of administration include oral and pulmonary formulations, suppositories, and transdermal applications. Oral formulations are most preferred for the H. pylori proteins. Dosage treatment may be a single dose schedule or a multiple dose schedule. The vaccine may be administered in conjunction with other immunoregulatory agents.

F. Immunodiagnostic Assays

H. pylori antigens encoded by the CagI regions can be used in immunoassays to detect antibody levels (or conversely H. pylori antibodies can be used to detect antigen levels) and correlation can be made with gastroduodenal disease and with duodenal ulcer in particular. Immunoassays based on well defined, recombinant antigens can be developed to replace the invasive diagnostics methods that are used today. Antibodies to H. pylori proteins within biological samples, including for example, blood or serum samples, can be detected. Design of the immunoassays is subject to a great deal of variation, and a variety of these are known in the art. Protocols for the immunoassay may be based, for example, upon competition, or direct reaction, or sandwich type assays. Protocols may also, for example, use solid supports, or may be by immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays which amplify the signals from the probe are also known; examples of which are assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such as ELISA assays.

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by packaging the appropriate materials, including the compositions of the invention, in suitable containers, along with the remaining reagents and materials (for example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as suitable set of assay instructions.

G. Examples

The examples presented below are provided as a further guide to the practitioner of ordinary skill in the art and are not to be construed as limiting the invention in any way.

1. Materials and methods

a. Growth of H. pylori and DNA isolation

H. pylori strains can be cultured in solid or liquid media for 3 days at 37° C., both in microaerophilic atmosphere using Oxoid (Basingstoke, England) or Becton and Dickinson (Cockeysville, Md.) gas pack generators or in an incubator containing air supplemented with 50% CO2, (26). The bacteria can be harvested and resuspended in STE (NaCl 0.1M, Tris-HCl 10 mM pH 8, EDTA 1 mM pH 8) containing lysozyme at a final concentration of 100 micrograms/ml and incubated at room temperature for 5 min. To lyse the bacteria SDS can be added to a final concentration 1% and heated at 65° C. After the addition of proteinase K at final concentration of 25 micrograms/ml, the solution can be incubated at 50° for 2 hours. The DNA can be purified by CsCl gradient in the presence of ethidium bromide, precipitated with 77% ethanol and recovered with a sealed glass capillary. b. Construction and screening of a lambda gt11 expression library

To generate a lambda gt11 expression library, genomic DNA from the CCUG 17874 strain partially digested with restriction enzymes can be used. After fractionation on 0.8% agarose gel, the DNA between 0.6 and 8 Kb in size can be eluted using a Costar Spin-X (0.22 micron) microcentrifuge filter. The products from each digestion can be combined, and used to construct an expression library, using the lambda gt11 cloning system kit (Bethesda Research Laboratories) and the Gigapack II Gold packaging kit (Stratagene, La Jolla, Calif.).

c. Cloning

Using overlapping libraries of H. pylori DNA from strain CCUG 17874 digested with EcoRI and HindIII, cloned into vector Bluscript SK+, contiguous clones covering more than 31,000 bases of the H. pylori chromosome were identified, containing a full-length copy of the CagI region. Using random transpose mutagenesis and ad hoc deletions in different subclones, phenotypic variants were produced by allelic exchange.

As can be seen from FIG. 1, four clones, E64 (14,500 bases), H12 (4000 bases), 5B (7800 bases), and 11.1A (5900 bases), generated from the plasmid library, contain the entire CagI region and these four clones have been deposited with the American Type Culture Collection (ATCC), see below. These clones overlap with the CagA region (see PCT Publ. No. WO 93/18150 for the sequence for CagA). If one looks at the sequences on FIG. 1 designated 1, 2, and 3, the following is noted: Sequence 1 (10299 bases) (SEQ ID NOS 1 and 5) contains the 3'-terminus of CagA, the entire CagA gene, and part of CagI. This sequence starts with nucleotide 1 and terminates at nucleotide 10299 (SEQ ID NOS 1 and 5). Within this region, from nucleotides 4909 to 10299 is the CagA locus. From nucleotide 1 to 4908 is the 3'-terminus of CagI. Sequence 2 (5599 bases) starts with nucleotide 1 and terminates at 5599 (SEQ ID NOS 9 and 13); there is a minor gap of 400 base pairs between nucleotide 5599 of sequence 2 and nucleotide 1 of sequence 1. Sequence 3 (1529 bases) starts with nucleotide 1 and terminates at 1529 (SEQ ID NOS 17 and 21). Again, there is a minor gap of 400 base pairs between nucleotide 1529 of sequence 3 and nucleotide 1 of sequence 2. By assembling sequences 1, 2, and 3, the CagI region is covered from the 5'-terminus to the 3'-terminus. Furthermore, FIG. 3 is the complete nucleotide sequence of the CagI locus (19,932 base pairs) (SEQ ID NO 25), including the true 5'-terminus of CagI, contained in clone E64, which is about 2500 bases upstream from the beginning of sequence 3.

d. Sequencing

The nucleotide sequences of the overlapping clones was determined by manual and automated sequencing using nested deletions and primer walking. The produced data were subjected to computer analysis using the Wisconsin Genetic Group package running on parallel supercomputers. FIGS. 2 and 3 show the nucleotide (SEQ ID NOS 1, 5, 9, 13, 17, and 21 for FIG. 2 and SEQ ID NO 25 for FIG. 3) and predicted amino acid sequences of the CagI region. (SEQ ID NOS 2-4, 6-8, 10-12, 14-16, 18-20, and 22-24 for FIG. 2) e. Structure of the CagI region

The CagI region contains clusters of putative open reading frames (ORFs) with different polarity. FIG. 4 shows the putative open reading frames (ORF) for this region and homologies with known proteins (SEQ ID NOS 26-46). It is hypothesized that some of these ORFs may encode exporter molecules with homology to the pt1 genes of Bordetella pertussis and VIR B4 genes of Agrobacterium tumefaciens and for proteins with motifs shared by purported invasion factors of Salmonella genus. Predicted amino acid sequences for the various frameshifts are shown in FIGS. 2 and 4.

f. Hybridizations

All the clones were tested by Southern blotting on the DNA isolated from 44 well-characterized clinical isolates of H. pylori. The contiguous fragments located in the CagI region were able to recognize homologous sequences in all Type I strains of H. pylori but failed to hybridize with the DNA from Type II strains of H. pylori. It is possible that the approximately 120 base pair DNA segment ("MAK") present at both the 5-' and 3'-terminus of the CagI region may account for evolutionary divergence of Type I and Type II H. pylori bacteria.

H. Deposit of Biological Materials

The following materials were deposited on Apr. 7, 1995, with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Md., phone (301) 231-5519, under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for Purposes of Patent Procedure.

ATCC No. 69781 E. coli host strain TG1 containing plasmid 11.1A (CMCC #4411);

ATCC No. 69782 E. coli host strain TG1 containing plasmid 5B (CMCC #4412);

ATCC No. 69783 E. coli host strain TG1 containing plasmid H12 (CMCC #4413); and

ATCC No. 69784 E. coli host strain DH10B containing plasmid E64 (CMCC #4421).

These deposits are provided as convenience to those of skill in the art, and are not an admission that a deposit is required under 35 U.S.C. §112. The nucleic acid sequences of these deposits, as well as the amino acid sequences of the polypeptides encoded thereby, are incorporated herein by reference and should be referred to in the event of any error in the sequences described herein as compared with the sequences of the deposits. A license may be required to make, use, or sell the deposited materials, and no such license is granted hereby.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 46     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 10299 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:1: (xi) SEQUENCE DESCRIPTION: SEQ     - TTTGTTCAGT GATTTCGCCT TCCATTTCTT CTTCTATGAA GTCCAATTCT TC - #TTTCAGTT       60     - CAAAAAGATA ATTAGAAAAA CTATCCAAAA TCGTCAAGAC ATCATTTTCA AA - #ATTTCCAA      120     - TAATTTTTGT TCACGCAAAT TTTGTTTCAT TTTAATACTC CTCTATTTGT TG - #ATACATTT      180     - GTCTCAAGGC CTGATATTTA TCTATGATAC TATGGTTTTG GATAATCTTA TC - #AATTTCTT      240     - TGACAAATAC AGTATCTGTG GATAAAATTT TCAAATATTC TTTAGGAATG CC - #TCTCAAAT      300     - TAAAACTAGC GATAACGCTA GGGCTTCCAT CCTGTTTGTA GAGGATTTTC CT - #ATCTAGTC      360     - CCTTAGTGAT GATTTCAAAT TCTTTTTCTG TAACATTAGC CAATCTTTGG TA - #ATCAGAAA      420     - GATTGCCCCC ATCGTTTCTC AAAAAAATCT TTGTAGGGCA TTGTTCTCTA AT - #CGTATCAG      480     - CAATAGGGCA AGCCAAAAGA TCAGTGATGC TTTGAGTCGC AAGTCTGACA AT - #AGCGTTTC      540     - TTTTCCTTGC AGTTTTTAGC ATGTCTCTTA CAAAATAAGC GACCTTTGGA TC - #GCTAAATA      600     - TTTCCAGGCT TCATCAATAT CTAAGACAAA TCTACGCCCA TCCATTGCCT CT - #TGGATACG      660     - AGCGAAAAGG TAAAAACAAA TAAAGGGCGA AACATCATTA TTGTCTAAGA AA - #CTTGACCC      720     - ATCAACGCCA ATAATCGTTT TTGAAAAATC TAAGCGATCT GTTGCTTTAT TA - #TCAAAAAG      780     - CCATTGAAAT TCACCATTGG TTGATTTGCA AAAAGGCGCT AATCGCGCGA CA - #AGCCCATT      840     - AGGATCATTG TGGTCTTTCC CGAAAGCATT AATAAGTTGA GTGATGGGAT AA - #TCTAGATT      900     - CATATTTCCT GTGATAAGGT TGGTTACTGC GCTGCAAGCG TATTAGAATC TG - #CTAGGCTA      960     - AAAGAGATGC TGTTGCCATT TTCATCTTTT TCATCGCTTT TAGTTGCTAA GT - #TTTTCACA     1020     - AGCTCTTTGA CAACAGAAAT AGCTGTTTGT TTTTGCTCCA TTGTTGCATT TG - #TTTTTTGC     1080     - ACACAAGCCG CCCAAGCAAA AGGATTTAAT CCTGTATCTG TCCCTAGCTC AA - #TCTTGACA     1140     - TACTCCCCAC CCATTGCGAC AATATTCCCA TAAGCGCCAT AATCTTTATC CA - #TATAAACC     1200     - ATAGTGAGCT TTTGCTTGTC TTTGCTGACA TTAGCAGGAA AATTATAGGC AA - #ATTGTCCC     1260     - ATAGCGTTCA AGGTCATTGA CATAAACACT GTCTTACCTG AACCGGTTGA GC - #CAAGTATC     1320     - AAAGTGTGTC CTGCTGAAGC TGAACCAAAA TCAGTGGGCA TGTGGAAGTT CA - #GATAAAAA     1380     - GGCGAATTGA TCTCGCTTTT TAGCGTCATC ACACTATTGC CCCAAGCGTT AT - #TCTCTTGA     1440     - TTGCCATCAA AACTCATAGC CCTCATAGCG ATGAAATCAG CAAAATTATT AG - #AAGTTACA     1500     - TCAAAAATAA AAGGAAGCGT GATAAAAGAG CAATGTTTGG CAAAAAAGTA AT - #TTTCCATA     1560     - GAGAAAGTCG CTGCGTTGGC TAAAAAACCT TTAGCGTTAA GACTAGAGAC GC - #ATTCCTTA     1620     - ACGCTTTGTT TCATTTTTTC AAAGCTATCA GCAAACAGCA CTAAAGAATT AC - #CATAACTG     1680     - CCTAGCGTAA TATCACCATT ACCCACTAAT TCGCTCAAGC AACCTAAAGT CA - #TGCCCTGT     1740     - TCTTTAGAGC CTCCACTAAT AATAATTCTT CTAGAGGTGA AAGCCAGTTT GT - #CCTTTAAA     1800     - ACCTGTGAGT TTTTAGGCGA ATAAGCATGC ATGAAAATAA ATTCGCTGTC TA - #GGGCGTTG     1860     - ATTTTATCAA ACAAATCGCT TTGTGATTTA GGGGCGTATT CACTAATCTC AA - #TAGCGCTA     1920     - AAATATTTTT CACTCAAATC GTCATTTAAG ATTTTTCCAT GCTTATTGGC AA - #AATAAACT     1980     - TCTTTCACCC CACCATGCAT TTTTTCCTTG AGATACAAGT CTTTTCGGTT GC - #AAATAAAA     2040     - GGGGCTTCAT TCATTCCCAA AAGAAAATTG TAAAATTCGC ATTGTTTGGA GT - #AAATAACG     2100     - CCATCTTTAG TGTATTCTTT TAATCTAGTG GGGTGGTATT TGCTCAACAG CT - #CTTCTATG     2160     - AGCTCTATCC TATCCTTGAA GTTTTCAAGC TTGGCTCTAA TAATCCTTTG AA - #ACTCTTCA     2220     - AAATTATTGT CTGCAAAATG CTTTTTATTC ATAACGGGTT CATTGAGAGT GT - #CTAATAAA     2280     - TCTTGCTCTA TGGTCAGAAA AAAACTAATA TCATAAAAAC TTTCTCTCTT TT - #GCTTCTCA     2340     - TTATAGGCTC GCATGAAATC ATTAGAAAAA ATAAGACCAT AGTCCCTATT GG - #TTTCATCA     2400     - ATAACGATTT TCTTTTTAAT AGTGTGAAAA TAGAATTTGA ATTCAGGGGT AA - #CAAAATTC     2460     - CTAAAAACGC TATAAATAGA AGCGTGTAAC TCTATGAGAT CTTTTTTGGA AG - #TGGTTAAA     2520     - AAATCAATGC CCCCCAATTT GATTGTGCCT AAAAGAGAAT AGTTGTTAGT AA - #GGATCACC     2580     - CCATCATCTA AAAAACATTC ATAGTTATTT GCTAGATAGG AGTTTGCAGC GC - #TCACAAGT     2640     - CTGTCTTCTC TGTTTGGATT TAAGTGGATG TCATTAGCCA TTTCTTTACT AG - #GCTTCATG     2700     - GAAAAAATGC TCATGAACGC TTTGTTTTTC ACGCCCTTAA ACAAAAAAGG TT - #TTTTAAAT     2760     - TTCATCGCTC GCTCCATTCT TTGATAAAGC CTATAATCTT TCTTGAATCC AA - #GAGCTACA     2820     - AGCACAATAA CAATCGCTAC AATCAAAACA GGTTCATAGG CTTGAAAAAG AA - #TAACAGAT     2880     - AATACAATGG TTACAAACAA TATAAATATA GAGGAATAAA TAAAAGTTTC AG - #GGAAACCA     2940     - AACAACCTAT TCCCCCCATC AAACAAGACT TTAAAAAAGG GATTGACACC CT - #TTTGCATG     3000     - TCTGCTTTAA GTTCTTCTAT TTTTTGAAAC TGCCGCTTTT GAACCTCTTG CT - #CTATAATT     3060     - AGCTTTTTTT GTTCATCAGC CTGCTTGCTT GCCACAAACA CCTCTCTCTT TA - #TAGATATA     3120     - CCGCTTCACA TGTAATCGTA TAAAAGATTT TTTTGAGAGA CTCTACGGTG CT - #AATATGTT     3180     - TCAAAAGATC ATTAGGATCA TAAGAATTGA ATACGGCCAA TAAAACATTA TA - #TAACTTAT     3240     - CATCGCATAG AATTTCTCTT GTTTCTCCGC GCAATGACAG AAAGCAGCGT TG - #TTTGTTGG     3300     - TCGTGCTGAT GCTTTTGAAA GTAAAAAAGT CTTTCACTTC AGGATTGATC TG - #TAATTCTA     3360     - CATTCAATCC CATTTCCTTA CCCTTTTCAT CAAAGATTTT TTCAATAACT GG - #ATCGTAAT     3420     - GCTTCAAATC CTTTATTTTT TTAAGGACTC TATTGACAAT CACGAAGTCA AA - #AACTTCAT     3480     - CTTTGATAAT ATCGGGATTG ACTTCTTTGA AAGTTACTTT CTTGTCTTTC AA - #ATTTTTGA     3540     - TAGTCGCTTT GAAACTATCA AAATCTAAAT TTGTATAAAC AAGCCCATTG GG - #AGTGTTTT     3600     - TTTCTTTTTC TTGTGCTTCT TTTTTGGCTT CTTTGTCATC ATTTGCTAAC CC - #ATACGAAC     3660     - TGAAAACAAC GAGACTTAAG AGAACTTTCA AAAAAAAGCC TCTTAGTTTC TT - #ATTGCTAT     3720     - TATTATTATT GTTGATCAAC TTAGCTAGCT CCTCCACCCT CGCCAATATT GA - #AGCCAAAC     3780     - TTAGTGCTCA AATAGATAAT ACCGCCTGCC ACCGCTAACA TAGCTATGGG TT - #GCGCGTAA     3840     - GCAAAAACAG TCGCCTGACC TCTTTTAATG TCATCAGAGA TTTTCCAAAT AT - #CCGCTATG     3900     - CCTTTGACCC CTAAAGCGCA ACCACCTACG ATCGCTAGAA CAGAAATGAT CT - #GAATAACC     3960     - AAACCTTTAG TTGCAGTGAC GCCTTCTGTA GGACTGGCGA CCGCCATTAA AG - #GATTGGTT     4020     - GTTACCACTA GCCCTAAAGT TACTACAACT TTCTTGTAGC TGTCAGTGAT TC - #TTGTAAAA     4080     - AATTTCATGC GTTTCCTTTC AAATTGAAAT CAATCGCTTG AGTATATCAA AA - #AAAAAAGT     4140     - ATTTTTATAC TATTCATACA AGCGCTACTT TATAATTTAA ATCAAAACCG AC - #GCTTTTGC     4200     - TCGGCAACTG ACATCATTCA GGAATAGTAA ACCTACTTGT CCCAACCATT TT - #TCTTTCTC     4260     - AAGTCGTTGT AGAATTGTAG ATCTTTAGGA TCTTTGATGT ATTTTTTAAT CG - #TCTCAGGT     4320     - TGAAACCTAA AAACAAGCAA AAACAAACCC AAGCTGATCA GAGTGAGAAT AA - #AGCTCCAT     4380     - TTTAAGCAAC TCCATAGACC ACTAAAGAAA CTTTTTTTGA GGCTATCTTT GA - #AAATCTGT     4440     - CCTATTGATT TGTTTTCCAT TTTGTTTCCC ATGTGGATCT TGTGGATCAC AA - #ACGCTTAA     4500     - TTATACATGC TATAGTAAGC ATGACACACA AACCAAACTA TTTTTAGAAC GC - #TTCATGTG     4560     - CTCACCTTGA CTAACCATTT CTCCAACCAT ACTTTAGCGT TGCATTTGAT TT - #CTTCAAAA     4620     - AGATTCATTT CTTATTTCTT GTTCTTATTA AAGTTCTTTC ATTTTAGCAA AT - #TTTTGTTA     4680     - ATTGTGGGTA AAAATGTGAA TCGTCCTAGC CTTTAGACGC CTGCAACGAT CG - #GGCTTTTT     4740     - TCAATATTAA TAATGATTAA TGAAAAAAAA AAAAAATGCT TGATATTGTT GT - #ATAATGAG     4800     - AATGTTCAAA GACATGAATT GACTACTCAA GCGTGTAGCG ATTTTTAGCA GT - #CTTTGACA     4860     - CTAACAAGAT ACCGATAGGT ATGAAACTAG GTATAGTAAG GAGAAACAAT GA - #CTAACGAA     4920     - ACCATTGACC AACAACCACA AACCGAAGCG GCTTTTAACC CGCAGCAATT TA - #TCAATAAT     4980     - CTTCAAGTAG CTTTTCTTAA AGTTGATAAC GCTGTCGCTT CATACGATCC TG - #ATCAAAAA     5040     - CCAATCGTTG ATAAGAACGA TAGGGATAAC AGGCAAGCTT TTGAAGGAAT CT - #CGCAATTA     5100     - AGGGAAGAAT ACTCCAATAA AGCGATCAAA AATCCTACCA AAAAGAATCA GT - #ATTTTTCA     5160     - GACTTTATCA ATAAGAGCAA TGATTTAATC AACAAAGACA ATCTCATTGA TG - #TAGAATCT     5220     - TCCACAAAGA GCTTTCAGAA ATTTGGGGAT CAGCGTTACC GAATTTTCAC AA - #GTTGGGTG     5280     - TCCCATCAAA ACGATCCGTC TAAAATCAAC ACCCGATCGA TCCGAAATTT TA - #TGGAAAAT     5340     - ATCATACAAC CCCCTATCCT TGATGATAAA GAGAAAGCGG AGTTTTTGAA AT - #CTGCCAAA     5400     - CAATCTTTTG CAGGAATCAT TATAGGGAAT CAAATCCGAA CGGATCAAAA GT - #TCATGGGC     5460     - GTGTTTGATG AGTCCTTGAA AGAAAGGCAA GAAGCAGAAA AAAATGGAGA GC - #CTACTGGT     5520     - GGGGATTGGT TGGATATTTT TCTCTCATTT ATATTTGACA AAAAACAATC TT - #CTGATGTC     5580     - AAAGAAGCAA TCAATCAAGA ACCAGTTCCC CATGTCCAAC CAGATATAGC CA - #CTACCACC     5640     - ACCGACATAC AAGGCTTACC GCCTGAAGCT AGAGATTTAC TTGATGAAAG GG - #GTAATTTT     5700     - TCTAAATTCA CTCTTGGCGA TATGGAAATG TTAGATGTTG AGGGAGTCGC TG - #ACATTGAT     5760     - CCCAATTACA AGTTCAATCA ATTATTGATT CACAATAACG CTCTGTCTTC TG - #TGTTAATG     5820     - GGGAGTCATA ATGGCATAGA ACCTGAAAAA GTTTCATTGT TGTATGGGGG CA - #ATGGTGGT     5880     - CCTGGAGCTA GGCATGATTG GAACGCCACC GTTGGTTATA AAGACCAACA AG - #GCAACAAT     5940     - GTGGCTACAA TAATTAATGT GCATATGAAA AACGGCAGTG GCTTAGTCAT AG - #CAGGTGGT     6000     - GAGAAAGGGA TTAACAACCC TAGTTTTTAT CTCTACAAAG AAGACCAACT CA - #CAGGCTCA     6060     - CAACGAGCAT TAAGTCAAGA AGAGATCCAA AACAAAATAG ATTTCATGGA AT - #TTCTTGCA     6120     - CAAAATAATG CTAAATTAGA CAACTTGAGC GAGAAAGAGA AGGAAAAATT CC - #GAACTGAG     6180     - ATTAAAGATT TCCAAAAAGA CTCTAAGGCT TATTTAGACG CCCTAGGGAA TG - #ATCGTATT     6240     - GCTTTTGTTT CTAAAAAAGA CACAAAACAT TCAGCTTTAA TTACTGAGTT TG - #GTAATGGG     6300     - GATTTGAGCT ACACTCTCAA AGATTATGGG AAAAAAGCAG ATAAAGCTTT AG - #ATAGGGAG     6360     - AAAAATGTTA CTCTTCAAGG TAGCCTAAAA CATGATGGCG TGATGTTTGT TG - #ATTATTCT     6420     - AATTTCAAAT ACACCAACGC CTCCAAGAAT CCCAATAAGG GTGTAGGCGT TA - #CGAATGGC     6480     - GTTTCCCATT TAGAAGTAGG CTTTAACAAG GTAGCTATCT TTAATTTGCC TG - #ATTTAAAT     6540     - AATCTCGCTA TCACTAGTTT CGTAAGGCGG AATTTAGAGG ATAAACTAAC CA - #CTAAAGGA     6600     - TTGTCCCCAC AAGAAGCTAA TAAGCTTATC AAAGATTTTT TGAGCAGCAA CA - #AAGAATTG     6660     - GTTGGAAAAA CTTTAAACTT CAATAAAGCT GTAGCTGACG CTAAAAACAC AG - #GCAATTAT     6720     - GATGAAGTGA AAAAAGCTCA GAAAGATCTT GAAAAATCTC TAAGGAAACG AG - #AGCATTTA     6780     - GAGAAAGAAG TAGAGAAAAA ATTGGAGAGC AAAAGCGGCA ACAAAAATAA AA - #TGGAAGCA     6840     - AAAGCTCAAG CTAACAGCCA AAAAGATGAG ATTTTTGCGT TGATCAATAA AG - #AGGCTAAT     6900     - AGAGACGCAA GAGCAATCGC TTACGCTCAG AATCTTAAAG GCATCAAAAG GG - #AATTGTCT     6960     - GATAAACTTG AAAATGTCAA CAAGAATTTG AAAGACTTTG ATAAATCTTT TG - #ATGAATTC     7020     - AAAAATGGCA AAAATAAGGA TTTCAGCAAG GCAGAAGAAA CACTAAAAGC CC - #TTAAAGGT     7080     - TCGGTGAAAG ATTTAGGTAT CAATCCAGAA TGGATTTCAA AAGTTGAAAA CC - #TTAATGCA     7140     - GCTTTGAATG AATTCAAAAA TGGCAAAAAT AAGGATTTCA GCAAGGTAAC GC - #AAGCAAAA     7200     - AGCGACCTTG AAAATTCCGT TAAAGATGTG ATCATCAATC AAAAGGTAAC GG - #ATAAAGTT     7260     - GATAATCTCA ATCAAGCGGT ATCAGTGGCT AAAGCAACGG GTGATTTCAG TA - #GGGTAGAG     7320     - CAAGCGTTAG CCGATCTCAA AAATTTCTCA AAGGAGCAAT TGGCCCAACA AG - #CTCAAAAA     7380     - AATGAAAGTC TCAATGCTAG AAAAAAATCT GAAATATATC AATCCGTTAA GA - #ATGGTGTG     7440     - AATGGAACCC TAGTCGGTAA TGGGTTATCT CAAGCAGAAG CCACAACTCT TT - #CTAAAAAC     7500     - TTTTCGGACA TCAAGAAAGA GTTGAATGCA AAACTTGGAA ATTTCAATAA CA - #ATAACAAT     7560     - AATGGACTCA AAAACGAACC CATTTATGCT AAAGTTAATA AAAAGAAAGC AG - #GGCAAGCA     7620     - GCTAGCCTTG AAGAACCCAT TTACGCTCAA GTTGCTAAAA AGGTAAATGC AA - #AAATTGAC     7680     - CGACTCAATC AAATAGCAAG TGGTTTGGGT GTTGTAGGGC AAGCAGCGGG CT - #TCCCTTTG     7740     - AAAAGGCATG ATAAAGTTGA TGATCTCAGT AAGGTAGGGC TTTCAAGGAA TC - #AAGAATTG     7800     - GCTCAGAAAA TTGACAATCT CAATCAAGCG GTATCAGAAG CTAAAGCAGG TT - #TTTTTGGC     7860     - AATCTAGAGC AAACGATAGA CAAGCTCAAA GATTCTACAA AACACAATCC CA - #TGAATCTA     7920     - TGGGTTGAAA GTGCAAAAAA AGTACCTGCT AGTTTGTCAG CGAAACTAGA CA - #ATTACGCT     7980     - ACTAACAGCC ACATACGCAT TAATAGCAAT ATCAAAAATG GAGCAATCAA TG - #AAAAAGCG     8040     - ACCGGCATGC TAACGCAAAA AAACCCTGAG TGGCTCAAGC TCGTGAATGA TA - #AGATAGTT     8100     - GCGCATAATG TAGGAAGCGT TCCTTTGTCA GAGTATGATA AAATTGGCTT CA - #ACCAGAAG     8160     - AATATGAAAG ATTATTCTGA TTCGTTCAAG TTTTCCACCA AGTTGAACAA TG - #CTGTAAAA     8220     - GACACTAATT CTGGCTTTAC GCAATTTTTA ACCAATGCAT TTTCTACAGC AT - #CTTATTAC     8280     - TGCTTGGCGA GAGAAAATGC GGAGCATGGA ATCAAGAACG TTAATACAAA AG - #GTGGTTTC     8340     - CAAAAATCTT AAAGGATTAA GGAATACCAA AAACGCAAAA ACCACCCCTT GC - #TAAAAGCG     8400     - AGGGGTTTTT TAATACTCCT TAGCAGAAAT CCCAATCGTC TTTAGTATTT GG - #GATGAATG     8460     - CTACCAATTC ATGGTATCAT ATCCCCATAC ATTCGTATCT AGCGTAGGAA GT - #GTGCAAAG     8520     - TTACGCCTTT GGAGATATGA TGTGTGAGAC CTGTAGGGAA TGCGTTGGAG CT - #CAAACTCT     8580     - GTAAAATCCC TATTATAGGG ACACAGAGTG AGAACCAAAC TCTCCCTACG GG - #CAACATCA     8640     - GCCTAGGAAG CCCAATCGTC TTTAGCGGTT GGGCACTTCA CCTTAAAATA TC - #CCGACAGA     8700     - CACTAACGAA AGGCTTTGTT CTTTAAAGTC TGCATGGATA TTTCCTACCC CA - #AAAAGACT     8760     - TAACCCTTTG CTTAAAATTA AGTTTGATTG TGCTAGTGGG TTCGTGCTAT AG - #TGCGAAAA     8820     - TTAATTAAGG GTTATAAAGA GAGCATAAAC TAGAAAAAAC AAGTAGCTAT AA - #CAAAGATC     8880     - AAGTTCAAAA AATCATAGAG CTTTTAGAGC AAATTGATCG CGCTCTTAAC CA - #AAGAAAAA     8940     - TCAGAAAAAC CATAGGAATT ATCACACCTT ATAATGCCCA AAAAAGACGC TT - #GCGATCAG     9000     - AAGTGGAAAA ATACGGCTTC AAGAATTTTG ATGAGCTCAA AATAGACACT GT - #GGATGCCT     9060     - TTCAAGGTGA AGAGGCAGAT ATTATTATTT ATTCCACCGT GAAAACTTGT GG - #TAATCTTT     9120     - CTTTCTTGCT AGATTCTAAA CGCTTGAATG TGGCTATTTC TAGGGCAAAA GA - #AAATCTCA     9180     - TTTTTGTGGG TAAAAAGTCT TTCTTTGAGA ATTTACGAAG CGATGAGAAG AA - #TATCTTTA     9240     - GCGCTATTTT GCAAGTCTGT AGATAGGTAA TCTTTTCCAA AGATAATCAT TA - #GACATTCT     9300     - TCGCTTCAAA ACGCTTTCAT AAATCTCTCT AAAGCGCTTT ATAATCAACA CA - #ATACCCTT     9360     - ATAGTGTGAG CTATAGCCCC TTTTTGGGAA TTGAGTTATT TTGACTTTAA AT - #TTTTATTA     9420     - GCGTTACAAT TTGAGCCATT CTTTAGCTTG TTTTTCTAGC CAGATCACAT CG - #CCGCTCGC     9480     - ATGAAATTCC ACTTTAGGGA ATGCGTGTGC ATTTTTTTTA AGGGCGTATT TT - #TGCTGCAA     9540     - ATATCCTACA ATAGCATCGC CCGAATGGAT GAGTAGGGGG GGTGTTGAAA GG - #GCAAAATG     9600     - CTCCATAAAA TAGCCCTCAA TTTTTTGAGC GATTAAGGGA AAATGCGTGC AA - #CCTAAAAT     9660     - AATCACTTCG GGAAAATCTT TAAGGGAGTG AAATAATAAC GCATGCAAGT TT - #CTAACAAT     9720     - TCGCCCTCTA AAATACTTTC TTCAATCAAA GGCACAAAAA GAGAAGTGGC TA - #AATGCGAA     9780     - ACATTCAAAT AGCCTTGTTG TTTCAGGGCA TTGTCATAAG CGTTGGATTG GA - #TCGTCGCT     9840     - TTTGTCCCTA GCACTAAAAT AGGGGCGTTT TTATCTTTTA CTTGTCGCTT GA - #TCGCTAAA     9900     - ATGCTTGGCT CAATCACGCC CACAATAGGG ATTTTGGAAT GCTTTTGCAT CT - #CTTCTAAA     9960     - GCTAGAGCGC TCGCTGTGTT GCATGCCACA ATCAATAATT CAATCTGGTG CG - #GTTTGAAA     10020     - AAATCCAAAG CCTCTAAGCC AAATTGCTTG ATCGTAGTGG GGTCTTTAGT GC - #CATAAGGC     10080     - ACTCTAGCCG TATCGCCATA ATAGATGATT TCATCAAATA ATTGCGCTTT TA - #AAAGGCTT     10140     - TTTAAAACGC TAAACCCTCC CACACCGCTA TCAAAAACGC CTATTTTCAT GA - #CACTTTTT     10200     - TAATTTAATG GGATTAATTA GGGATTTTAT TTTTCATTCA TTAAGTTTAA AA - #ATTCTTCA     10260     # 10299            GCAT TTTAGAATAG ACAAAGCTT     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3289 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:2: (xi) SEQUENCE DESCRIPTION: SEQ     -      Phe Val Gln Phe Arg Leu Pro Phe - # Leu Leu Leu Ser Pro Ile Leu     Leu     #   15     -      Ser Val Gln Lys Asp Asn Lys Asn - # Tyr Pro Lys Ser Ser Arg His     His     #                 30     -      Phe Gln Asn Phe Gln Phe Leu Phe - # Thr Gln Ile Leu Phe His Phe     Asn     #             45     -      Thr Pro Leu Phe Val Asp Thr Phe - # Val Ser Arg Pro Asp Ile Tyr     Leu     #         60     -      Tyr Tyr Gly Phe Gly Ser Tyr Gln - # Phe Leu Gln Ile Gln Tyr Leu     Trp     #     80     -      Ile Lys Phe Ser Asn Ile Leu Glu - # Cys Leu Ser Asn Asn Arg Arg     Gly     #   95     -      Phe His Pro Val Cys Arg Gly Phe - # Ser Tyr Leu Val Pro Phe Gln     Ile     #                110     -      Leu Phe Leu His Pro Ile Phe Gly - # Asn Gln Lys Asp Cys Pro His     Arg     #            125     -      Phe Ser Lys Lys Ser Leu Gly Ile - # Val Leu Ser Tyr Gln Gln Gly     Lys     #        140     -      Pro Lys Asp Gln Cys Phe Glu Ser - # Gln Val Gln Arg Phe Phe Ser     Leu     #    160     -      Gln Phe Leu Ala Cys Leu Leu Gln - # Asn Lys Arg Pro Leu Asp Arg     Ile     #   175     -      Phe Pro Gly Phe Ile Asn Ile Asp - # Lys Ser Thr Pro Ile His Cys     Leu     #                190     -      Leu Asp Thr Ser Glu Lys Val Lys - # Thr Asn Lys Gly Arg Asn Ile     Ile     #            205     -      Ile Val Glu Thr Pro Ile Asn Ala - # Asn Asn Arg Phe Lys Ile Ala     Ile     #        220     -      Cys Cys Phe Ile Ile Lys Lys Pro - # Leu Lys Phe Thr Ile Gly Phe     Ala     #    240     -      Lys Arg Arg Ser Arg Asp Lys Pro - # Ile Arg Ile Ile Val Val Phe     Pro     #   255     -      Glu Ser Ile Asn Lys Leu Ser Asp - # Gly Ile Ile Ile His Ile Ser     Cys     #                270     -      Asp Lys Val Gly Tyr Cys Ala Ala - # Ser Val Leu Glu Ser Ala Arg     Leu     #            285     -      Lys Glu Met Leu Leu Pro Phe Ser - # Ser Phe Ser Ser Leu Leu Val     Ala     #        300     -      Lys Phe Phe Thr Ser Ser Leu Thr - # Thr Glu Ile Ala Val Cys Phe     Cys     #    320     -      Ser Ile Val Ala Phe Val Phe Cys - # Thr Gln Ala Ala Gln Ala Lys     Gly     #   335     -      Phe Asn Pro Val Ser Val Pro Ser - # Ser Ile Leu Thr Tyr Ser Pro     Pro     #                350     -      Ile Ala Thr Ile Phe Pro Ala Pro - # Ser Leu Ser Ile Thr Ile Val     Ser     #            365     -      Phe Cys Leu Ser Leu Leu Thr Leu - # Ala Gly Lys Leu Ala Asn Cys     Pro     #        380     -      Ile Ala Phe Lys Val Ile Asp Ile - # Asn Thr Val Leu Pro Glu Pro     Val     #    400     -      Glu Pro Ser Ile Lys Val Cys Pro - # Ala Glu Ala Glu Pro Lys Ser     Val     #   415     -      Gly Met Trp Lys Phe Arg Lys Gly - # Glu Leu Ile Ser Leu Phe Ser     Val     #                430     -      Ile Thr Leu Leu Pro Gln Ala Leu - # Phe Ser Leu Pro Ser Lys Leu     Ile     #            445     -      Ala Leu Ile Ala Met Lys Ser Ala - # Lys Leu Leu Glu Val Thr Ser     Lys     #        460     -      Ile Lys Gly Ser Val Ile Lys Glu - # Gln Cys Leu Ala Lys Lys Phe     Ser     #    480     -      Ile Glu Lys Val Ala Ala Leu Ala - # Lys Lys Pro Leu Ala Leu Arg     Leu     #   495     -      Glu Thr His Ser Leu Thr Leu Cys - # Phe Ile Phe Ser Lys Leu Ser     Ala     #                510     -      Asn Ser Thr Lys Glu Leu Pro Leu - # Pro Ser Val Ile Ser Pro Leu     Pro     #            525     -      Thr Asn Ser Leu Lys Gln Pro Lys - # Val Met Pro Cys Ser Leu Glu     Pro     #        540     -      Pro Leu Ile Ile Ile Leu Leu Glu - # Val Lys Ala Ser Leu Ser Phe     Lys     #    560     -      Thr Cys Glu Phe Leu Gly Glu Ala - # Cys Met Lys Ile Asn Ser Leu     Ser     #   575     -      Arg Ala Leu Ile Leu Ser Asn Lys - # Ser Leu Cys Asp Leu Gly Ala     Tyr     #                590     -      Ser Leu Ile Ser Ile Ala Leu Lys - # Tyr Phe Ser Leu Lys Ser Ser     Phe     #            605     -      Lys Ile Phe Pro Cys Leu Leu Ala - # Lys Thr Ser Phe Thr Pro Pro     Cys     #        620     -      Ile Phe Ser Leu Arg Tyr Lys Ser - # Phe Arg Leu Gln Ile Lys Gly     Ala     #    640     -      Ser Phe Ile Pro Lys Arg Lys Leu - # Asn Ser His Cys Leu Glu Ile     Thr     #   655     -      Pro Ser Leu Val Tyr Ser Phe Asn - # Leu Val Gly Trp Tyr Leu Leu     Asn     #                670     -      Ser Ser Ser Met Ser Ser Ile Leu - # Ser Leu Lys Phe Ser Ser Leu     Ala     #            685     -      Leu Ile Ile Leu Asn Ser Ser Lys - # Leu Leu Ser Ala Lys Cys Phe     Leu     #        700     -      Phe Ile Thr Gly Ser Leu Arg Val - # Ser Asn Lys Ser Cys Ser Met     Val     #    720     -      Arg Lys Lys Leu Ile Ser Lys Leu - # Ser Leu Phe Cys Phe Ser Leu     Ala     #   735     -      Arg Met Lys Ser Leu Glu Lys Ile - # Arg Pro Ser Leu Leu Val Ser     Ser     #                750     -      Ile Thr Ile Phe Phe Leu Ile Val - # Lys Asn Leu Asn Ser Gly Val     Thr     #            765     -      Lys Phe Leu Lys Thr Leu Ile Glu - # Ala Cys Asn Ser Met Arg Ser     Phe     #        780     -      Leu Glu Val Val Lys Lys Ser Met - # Pro Pro Asn Leu Ile Val Pro     Lys     #    800     -      Arg Glu Leu Leu Val Arg Ile Thr - # Pro Ser Ser Lys Lys His Ser     Leu     #   815     -      Phe Ala Arg Glu Phe Ala Ala Leu - # Thr Ser Leu Ser Ser Leu Phe     Gly     #                830     -      Phe Lys Trp Met Ser Leu Ala Ile - # Ser Leu Leu Gly Phe Met Glu     Lys     #            845     -      Met Leu Met Asn Ala Leu Phe Phe - # Thr Pro Leu Asn Lys Lys Gly     Phe     #        860     -      Leu Asn Phe Ile Ala Arg Ser Ile - # Leu Ser Leu Ser Phe Leu Asn     Pro     #    880     -      Arg Ala Thr Ser Thr Ile Thr Ile - # Ala Thr Ile Lys Thr Gly Ser     Ala     #   895     -      Lys Arg Ile Thr Asp Asn Thr Met - # Val Thr Asn Asn Ile Asn Ile     Glu     #                910     -      Glu Ile Lys Val Ser Gly Lys Pro - # Asn Asn Leu Phe Pro Pro Ser     Asn     #            925     -      Lys Thr Leu Lys Lys Gly Leu Thr - # Pro Phe Cys Met Ser Ala Leu     Ser     #        940     -      Ser Ser Ile Phe Asn Cys Arg Phe - # Thr Ser Cys Ser Ile Ile Ser     Phe     #    960     -      Phe Cys Ser Ser Ala Cys Leu Leu - # Ala Thr Asn Thr Ser Leu Phe     Ile     #   975     -      Asp Ile Pro Leu His Met Ser Tyr - # Lys Arg Phe Phe Glu Thr Leu     Arg     #                990     -      Cys Tyr Val Ser Lys Asp His Asp - # His Lys Asn Ile Arg Pro Ile     Lys     #           10050     -      His Tyr Ile Thr Tyr His Arg Ile - # Glu Phe Leu Leu Phe Leu Arg     Ala     #       10205     -      Met Thr Glu Ser Ser Val Val Cys - # Trp Ser Cys Cys Phe Lys Lys     Ser     #  10405     -      Leu Ser Leu Gln Asp Ser Val Ile - # Leu His Ser Ile Pro Phe Pro     Tyr     # 10550     -      Pro Phe His Gln Arg Phe Phe Gln - # Leu Asp Arg Asn Ala Ser Asn     Pro     #               10700 - #                1065     -      Leu Phe Phe Gly Leu Tyr Gln Ser - # Arg Ser Gln Lys Leu His Leu     Tyr     #           10850     -      Arg Asp Leu Leu Lys Leu Leu Ser - # Cys Leu Ser Asn Phe Ser Leu     Asn     #       11005     -      Tyr Gln Asn Leu Asn Leu Tyr Lys - # Gln Ala His Trp Glu Cys Phe     Phe     #  11205     -      Leu Phe Leu Val Leu Leu Phe Trp - # Leu Leu Cys His His Leu Leu     Thr     # 11350     -      His Thr Asn Lys Gln Arg Asp Leu - # Arg Glu Leu Ser Lys Lys Ser     Leu     #               11500 - #                1145     -      Leu Val Ser Tyr Cys Tyr Tyr Tyr - # Tyr Cys Ser Thr Leu Ala Pro     Pro     #           11650     -      Pro Ser Pro Ile Leu Lys Pro Asn - # Leu Val Leu Lys Ile Ile Pro     Pro     #       11805     -      Ala Thr Ala Asn Ile Ala Met Gly - # Cys Ala Ala Lys Thr Val Ala     Pro     #  12005     -      Leu Leu Met Ser Ser Glu Ile Phe - # Gln Ile Ser Ala Met Pro Leu     Thr     # 12150     -      Pro Lys Ala Gln Pro Pro Thr Ile - # Ala Arg Thr Glu Met Ile Ile     Thr     #               12300 - #                1225     -      Lys Pro Leu Val Ala Val Thr Pro - # Ser Val Gly Leu Ala Thr Ala     Ile     #           12450     -      Lys Gly Leu Val Val Thr Thr Ser - # Pro Lys Val Thr Thr Thr Phe     Leu     #       12605     -      Leu Ser Val Ile Leu Val Lys Asn - # Phe Met Arg Phe Leu Ser Asn     Asn     #  12805     -      Gln Ser Leu Glu Tyr Ile Lys Lys - # Lys Ser Ile Phe Ile Leu Phe     Ile     # 12950     -      Gln Ala Leu Leu Tyr Asn Leu Asn - # Gln Asn Arg Arg Phe Cys Ser     Ala     #               13100 - #                1305     -      Thr Asp Ile Ile Gln Glu Thr Tyr - # Leu Ser Gln Pro Phe Phe Phe     Leu     #           13250     -      Lys Ser Leu Asn Cys Arg Ser Leu - # Gly Ser Leu Met Tyr Phe Leu     Ile     #       13405     -      Val Ser Gly Asn Leu Lys Thr Ser - # Lys Asn Lys Pro Lys Leu Ile     Arg     #  13605     -      Val Arg Ile Lys Leu His Phe Lys - # Gln Leu His Arg Pro Leu Lys     Lys     # 13750     -      Leu Phe Leu Arg Leu Ser Leu Lys - # Ile Cys Pro Ile Asp Leu Phe     Ser     #               13900 - #                1385     -      Ile Leu Phe Pro Met Trp Ile Leu - # Trp Ile Thr Asn Ala Leu Tyr     Met     #           14050     -      Leu Ala His Thr Asn Gln Thr Ile - # Phe Arg Thr Leu His Val Leu     Thr     #       14205     -      Leu Thr Asn His Phe Ser Asn His - # Thr Leu Ala Leu His Leu Ile     Ser     #  14405     -      Ser Lys Arg Phe Ile Ser Tyr Phe - # Leu Phe Leu Leu Lys Phe Phe     His     # 14550     -      Phe Ser Lys Phe Leu Leu Ile Val - # Gly Lys Asn Val Asn Arg Pro     Ser     #               14700 - #                1465     -      Leu Thr Pro Ala Thr Ile Gly Leu - # Phe Ser Ile Leu Ile Met Ile     Asn     #           14850     -      Glu Lys Lys Lys Lys Cys Leu Ile - # Leu Leu Tyr Asn Glu Asn Val     Gln     #       15005     -      Arg His Glu Leu Thr Thr Gln Ala - # Cys Ser Asp Phe Gln Ser Leu     Thr     #  15205     -      Leu Thr Arg Tyr Arg Val Asn Val - # Gly Glu Thr Met Thr Asn Glu     Thr     # 15350     -      Ile Asp Gln Gln Pro Gln Thr Glu - # Ala Ala Phe Asn Pro Gln Gln     Phe     #               15500 - #                1545     -      Ile Asn Asn Leu Gln Val Ala Phe - # Leu Lys Val Asp Asn Ala Val     Ala     #           15650     -      Ser Tyr Asp Pro Asp Gln Lys Pro - # Ile Val Asp Lys Asn Asp Arg     Asp     #       15805     -      Asn Arg Gln Ala Phe Glu Gly Ile - # Ser Gln Leu Arg Glu Glu Tyr     Ser     #  16005     -      Asn Lys Ala Ile Lys Asn Pro Thr - # Lys Lys Asn Gln Tyr Phe Ser     Asp     # 16150     -      Phe Ile Asn Lys Ser Asn Asp Leu - # Ile Asn Lys Asp Asn Leu Ile     Asp     #               16300 - #                1625     -      Val Glu Ser Ser Thr Lys Ser Phe - # Gln Lys Phe Gly Asp Gln Arg     Tyr     #           16450     -      Arg Ile Phe Thr Ser Trp Val Ser - # His Gln Asn Asp Pro Ser Lys     Ile     #       16605     -      Asn Thr Arg Ser Ile Arg Asn Phe - # Met Glu Asn Ile Ile Gln Pro     Pro     #  16805     -      Ile Leu Asp Asp Lys Glu Lys Ala - # Glu Phe Leu Lys Ser Ala Lys     Gln     # 16950     -      Ser Phe Ala Gly Ile Ile Ile Gly - # Asn Gln Ile Arg Thr Asp Gln     Lys     #               17100 - #                1705     -      Phe Met Gly Val Phe Asp Glu Ser - # Leu Lys Glu Arg Gln Glu Ala     Glu     #           17250     -      Lys Asn Gly Glu Pro Thr Gly Gly - # Asp Trp Leu Asp Ile Phe Leu     Ser     #       17405     -      Phe Ile Phe Asp Lys Lys Gln Ser - # Ser Asp Val Lys Glu Ala Ile     Asn     #  17605     -      Gln Glu Pro Val Pro His Val Gln - # Pro Asp Ile Ala Thr Thr Thr     Thr     # 17750     -      Asp Ile Gln Gly Leu Pro Pro Glu - # Ala Arg Asp Leu Leu Asp Glu     Arg     #               17900 - #                1785     -      Gly Asn Phe Ser Lys Phe Thr Leu - # Gly Asp Met Glu Met Leu Asp     Val     #           18050     -      Glu Gly Val Ala Asp Ile Asp Pro - # Asn Tyr Lys Phe Asn Gln Leu     Leu     #       18205     -      Ile His Asn Asn Ala Leu Ser Ser - # Val Leu Met Gly Ser His Asn     Gly     #  18405     -      Ile Glu Pro Glu Lys Val Ser Leu - # Leu Tyr Gly Gly Asn Gly Gly     Pro     # 18550     -      Gly Ala Arg His Asp Trp Asn Ala - # Thr Val Gly Tyr Lys Asp Gln     Gln     #               18700 - #                1865     -      Gly Asn Asn Val Ala Thr Ile Ile - # Asn Val His Met Lys Asn Gly     Ser     #           18850     -      Gly Leu Val Ile Ala Gly Gly Glu - # Lys Gly Ile Asn Asn Pro Ser     Phe     #       19005     -      Tyr Leu Tyr Lys Glu Asp Gln Leu - # Thr Gly Ser Gln Arg Ala Leu     Ser     #  19205     -      Gln Glu Glu Ile Gln Asn Lys Ile - # Asp Phe Met Glu Phe Leu Ala     Gln     # 19350     -      Asn Asn Ala Lys Leu Asp Asn Leu - # Ser Glu Lys Glu Lys Glu Lys     Phe     #               19500 - #                1945     -      Arg Thr Glu Ile Lys Asp Phe Gln - # Lys Asp Ser Lys Ala Tyr Leu     Asp     #           19650     -      Ala Leu Gly Asn Asp Arg Ile Ala - # Phe Val Ser Lys Lys Asp Thr     Lys     #       19805     -      His Ser Ala Leu Ile Thr Glu Phe - # Gly Asn Gly Asp Leu Ser Tyr     Thr     #  20005     -      Leu Lys Asp Tyr Gly Lys Lys Ala - # Asp Lys Ala Leu Asp Arg Glu     Lys     # 20150     -      Asn Val Thr Leu Gln Gly Ser Leu - # Lys His Asp Gly Val Met Phe     Val     #               20300 - #                2025     -      Asp Tyr Ser Asn Phe Lys Tyr Thr - # Asn Ala Ser Lys Asn Pro Asn     Lys     #           20450     -      Gly Val Gly Val Thr Asn Gly Val - # Ser His Leu Glu Val Gly Phe     Asn     #       20605     -      Lys Val Ala Ile Phe Asn Leu Pro - # Asp Leu Asn Asn Leu Ala Ile     Thr     #  20805     -      Ser Phe Val Arg Arg Asn Leu Glu - # Asp Lys Leu Thr Thr Lys Gly     Leu     # 20950     -      Ser Pro Gln Glu Ala Asn Lys Leu - # Ile Lys Asp Phe Leu Ser Ser     Asn     #               21100 - #                2105     -      Lys Glu Leu Val Gly Lys Thr Leu - # Asn Phe Asn Lys Ala Val Ala     Asp     #           21250     -      Ala Lys Asn Thr Gly Asn Tyr Asp - # Glu Val Lys Lys Ala Gln Lys     Asp     #       21405     -      Leu Glu Lys Ser Leu Arg Lys Arg - # Glu His Leu Glu Lys Glu Val     Glu     #  21605     -      Lys Lys Leu Glu Ser Lys Ser Gly - # Asn Lys Asn Lys Met Glu Ala     Lys     # 21750     -      Ala Gln Ala Asn Ser Gln Lys Asp - # Glu Ile Phe Ala Leu Ile Asn     Lys     #               21900 - #                2185     -      Glu Ala Asn Arg Asp Ala Arg Ala - # Ile Ala Tyr Ala Gln Asn Leu     Lys     #           22050     -      Gly Ile Lys Arg Glu Leu Ser Asp - # Lys Leu Glu Asn Val Asn Lys     Asn     #       22205     -      Leu Lys Asp Phe Asp Lys Ser Phe - # Asp Glu Phe Lys Asn Gly Lys     Asn     #  22405     -      Lys Asp Phe Ser Lys Ala Glu Glu - # Thr Leu Lys Ala Leu Lys Gly     Ser     # 22550     -      Val Lys Asp Leu Gly Ile Asn Pro - # Glu Trp Ile Ser Lys Val Glu     Asn     #               22700 - #                2265     -      Leu Asn Ala Ala Leu Asn Glu Phe - # Lys Asn Gly Lys Asn Lys Asp     Phe     #           22850     -      Ser Lys Val Thr Gln Ala Lys Ser - # Asp Leu Glu Asn Ser Val Lys     Asp     #       23005     -      Val Ile Ile Asn Gln Lys Val Thr - # Asp Lys Val Asp Asn Leu Asn     Gln     #  23205     -      Ala Val Ser Val Ala Lys Ala Thr - # Gly Asp Phe Ser Arg Val Glu     Gln     # 23350     -      Ala Leu Ala Asp Leu Lys Asn Phe - # Ser Lys Glu Gln Leu Ala Gln     Gln     #               23500 - #                2345     -      Ala Gln Lys Asn Glu Ser Leu Asn - # Ala Arg Lys Lys Ser Glu Ile     Tyr     #           23650     -      Gln Ser Val Lys Asn Gly Val Asn - # Gly Thr Leu Val Gly Asn Gly     Leu     #       23805     -      Ser Gln Ala Glu Ala Thr Thr Leu - # Ser Lys Asn Phe Ser Asp Ile     Lys     #  24005     -      Lys Glu Leu Asn Ala Lys Leu Gly - # Asn Phe Asn Asn Asn Asn Asn     Asn     # 24150     -      Gly Leu Lys Asn Glu Pro Ile Tyr - # Ala Lys Val Asn Lys Lys Lys     Ala     #               24300 - #                2425     -      Gly Gln Ala Ala Ser Leu Glu Glu - # Pro Ile Tyr Ala Gln Val Ala     Lys     #           24450     -      Lys Val Asn Ala Lys Ile Asp Arg - # Leu Asn Gln Ile Ala Ser Gly     Leu     #       24605     -      Gly Val Val Gly Gln Ala Ala Gly - # Phe Pro Leu Lys Arg His Asp     Lys     #  24805     -      Val Asp Asp Leu Ser Lys Val Gly - # Leu Ser Arg Asn Gln Glu Leu     Ala     # 24950     -      Gln Lys Ile Asp Asn Leu Asn Gln - # Ala Val Ser Glu Ala Lys Ala     Gly     #               25100 - #                2505     -      Phe Phe Gly Asn Leu Glu Gln Thr - # Ile Asp Lys Leu Lys Asp Ser     Thr     #           25250     -      Lys His Asn Pro Met Asn Leu Trp - # Val Glu Ser Ala Lys Lys Val     Pro     #       25405     -      Ala Ser Leu Ser Ala Lys Leu Asp - # Asn Tyr Ala Thr Asn Ser His     Ile     #  25605     -      Arg Ile Asn Ser Asn Ile Lys Asn - # Gly Ala Ile Asn Glu Lys Ala     Thr     # 25750     -      Gly Met Leu Thr Gln Lys Asn Pro - # Glu Trp Leu Lys Leu Val Asn     Asp     #               25900 - #                2585     -      Lys Ile Val Ala His Asn Val Gly - # Ser Val Pro Leu Ser Glu Tyr     Asp     #           26050     -      Lys Ile Gly Phe Asn Gln Lys Asn - # Met Lys Asp Tyr Ser Asp Ser     Phe     #       26205     -      Lys Phe Ser Thr Lys Leu Asn Asn - # Ala Val Lys Asp Thr Asn Ser     Gly     #  26405     -      Phe Thr Gln Phe Leu Thr Asn Ala - # Phe Ser Thr Ala Ser Tyr Tyr     Cys     # 26550     -      Leu Ala Arg Glu Asn Ala Glu His - # Gly Ile Lys Asn Val Asn Thr     Lys     #               26700 - #                2665     -      Gly Gly Phe Gln Lys Ser Arg Ile - # Lys Glu Tyr Gln Lys Arg Lys     Asn     #           26850     -      His Pro Leu Leu Lys Ala Arg Gly - # Phe Leu Ile Leu Leu Ser Arg     Asn     #       27005     -      Pro Asn Arg Leu Tyr Leu Gly Met - # Leu Pro Ile His Gly Ile Ile     Ser     #  27205     -      Pro Tyr Ile Arg Ile Arg Arg Lys - # Cys Ala Lys Leu Arg Leu Trp     Arg     # 27350     -      Tyr Asp Val Asp Leu Gly Met Arg - # Trp Ser Ser Asn Ser Val Lys     Ser     #               27500 - #                2745     -      Leu Leu Gly His Arg Val Arg Thr - # Lys Leu Ser Leu Arg Ala Thr     Ser     #           27650     -      Ala Glu Ala Gln Ser Ser Leu Ala - # Val Gly His Phe Thr Leu Lys     Tyr     #       27805     -      Pro Asp Arg His Arg Lys Ala Leu - # Phe Phe Lys Val Cys Met Asp     Ile     #  28005     -      Ser Tyr Pro Lys Lys Thr Pro Phe - # Ala Asn Val Leu Cys Trp Val     Arg     # 28150     -      Ala Ile Val Arg Lys Leu Ile Lys - # Gly Tyr Lys Glu Ser Ile Asn     Lys     #               28300 - #                2825     -      Lys Gln Val Ala Ile Thr Lys Ile - # Lys Phe Lys Lys Ser Ser Phe     Ser     #           28450     -      Lys Leu Ile Ala Leu Leu Thr Lys - # Glu Lys Ser Glu Lys Pro Glu     Leu     #       28605     -      Ser His Leu Ile Met Pro Lys Lys - # Asp Ala Cys Asp Gln Lys Trp     Lys     #  28805     -      Asn Thr Ala Ser Arg Ile Leu Met - # Ser Ser Lys Thr Leu Trp Met     Pro     # 28950     -      Phe Lys Val Lys Arg Gln Ile Leu - # Leu Phe Ile Pro Pro Lys Leu     Val     #               29100 - #                2905     -      Val Ile Phe Leu Ser Cys Ile Leu - # Asn Ala Met Trp Leu Phe Leu     Gly     #           29250     -      Gln Lys Lys Ile Ser Phe Leu Trp - # Val Lys Ser Leu Ser Leu Arg     Ile     #       29405     -      Tyr Glu Ala Met Arg Arg Ile Ser - # Leu Ala Leu Phe Cys Lys Ser     Val     #  29605     -      Asp Arg Ser Phe Pro Lys Ile Ile - # Ile Arg His Ser Ser Leu Gln     Asn     # 29750     -      Ala Phe Ile Asn Leu Ser Lys Ala - # Leu Tyr Asn Gln His Asn Thr     Leu     #               29900 - #                2985     -      Ile Val Ala Ile Ala Pro Phe Trp - # Glu Leu Ser Tyr Phe Asp Phe     Lys     #           30050     -      Phe Leu Leu Ala Leu Gln Phe Glu - # Pro Phe Phe Ser Leu Phe Phe     Pro     #       30205     -      Asp His Ile Ala Ala Arg Met Lys - # Phe His Phe Arg Glu Cys Val     Cys     #  30405     -      Ile Phe Phe Lys Gly Val Phe Leu - # Leu Gln Ile Ser Tyr Asn Ser     Ile     # 30550     -      Ala Arg Met Asp Glu Gly Gly Cys - # Lys Gly Lys Met Leu His Lys     Ile     #               30700 - #                3065     -      Ala Leu Asn Phe Leu Ser Asp Gly - # Lys Met Arg Ala Thr Asn Asn     His     #           30850     -      Phe Gly Lys Ile Phe Lys Gly Val - # Lys Arg Met Gln Val Ser Asn     Asn     #       31005     -      Ser Pro Ser Lys Ile Leu Ser Ser - # Ile Lys Gly Thr Lys Arg Glu     Val     #  31205     -      Ala Lys Cys Glu Thr Phe Lys Pro - # Cys Cys Phe Arg Ala Leu Ser     Ala     # 31350     -      Leu Asp Trp Ile Val Ala Phe Val - # Pro Ser Thr Lys Ile Gly Ala     Phe     #               31500 - #                3145     -      Leu Ser Phe Thr Cys Arg Leu Ile - # Ala Lys Met Leu Gly Ser Ile     Thr     #           31650     -      Pro Thr Ile Gly Ile Leu Glu Cys - # Phe Cys Ile Ser Ser Lys Ala     Arg     #       31805     -      Ala Leu Ala Val Leu His Ala Thr - # Ile Asn Asn Ser Ile Trp Cys     Gly     #  32005     -      Leu Lys Lys Ser Lys Ala Ser Lys - # Pro Asn Cys Leu Ile Val Val     Gly     # 32150     -      Ser Leu Val Pro Gly Thr Leu Ala - # Val Ser Pro Met Ile Ser Ser     Asn     #               32300 - #                3225     -      Asn Cys Ala Phe Lys Arg Leu Phe - # Lys Thr Leu Asn Pro Pro Thr     Pro     #           32450     -      Leu Ser Lys Thr Pro Ile Phe Met - # Thr Leu Phe Phe Asn Gly Ile     Asn     #       32605     -      Gly Phe Tyr Phe Ser Phe Ile Lys - # Phe Lys Asn Ser Ser Leu Ser     Leu     #  32805     -      Val Cys Cys Ile Leu Glu Thr Lys - # Leu     #   3285     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3174 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:3: (xi) SEQUENCE DESCRIPTION: SEQ     -      Leu Phe Ser Asp Phe Ala Phe His - # Phe Phe Phe Tyr Glu Val Gln     Phe     #   15     -      Phe Phe Gln Phe Lys Lys Ile Ile - # Arg Lys Thr Ile Gln Asn Arg     Gln     #                 30     -      Asp Ile Ile Phe Lys Ile Ser Asn - # Asn Phe Cys Ser Arg Lys Phe     Cys     #             45     -      Phe Ile Leu Ile Leu Leu Tyr Leu - # Leu Ile His Leu Ser Gln Gly     Leu     #         60     -      Ile Phe Ile Tyr Asp Thr Met Val - # Leu Asp Asn Leu Ile Asn Phe     Phe     #     80     -      Asp Lys Tyr Ser Ile Cys Gly Asn - # Phe Gln Ile Phe Phe Arg Asn     Ala     #   95     -      Ser Gln Ile Lys Thr Ser Asp Asn - # Ala Arg Ala Ser Ile Leu Phe     Val     #                110     -      Glu Asp Phe Pro Ile Ser Leu Ser - # Asp Asp Phe Lys Phe Phe Phe     Cys     #            125     -      Asn Ile Ser Gln Ser Leu Val Ile - # Arg Lys Ile Ala Pro Ile Val     Ser     #        140     -      Gln Lys Asn Leu Cys Arg Ala Leu - # Phe Ser Asn Arg Ile Ser Asn     Arg     #    160     -      Ala Ser Gln Lys Ile Ser Asp Ala - # Leu Ser Arg Lys Ser Asp Asn     Ser     #   175     -      Val Ser Phe Pro Cys Ser Phe His - # Val Ser Tyr Lys Ile Ser Asp     Leu     #                190     -      Trp Ile Ala Lys Tyr Phe Gln Ala - # Ser Ser Ile Ser Lys Thr Asn     Leu     #            205     -      Arg Pro Ser Ile Ala Ser Trp Ile - # Arg Ala Lys Arg Lys Gln Ile     Lys     #        220     -      Gly Glu Thr Ser Leu Leu Ser Lys - # Lys Leu Asp Pro Ser Thr Pro     Ile     #    240     -      Ile Val Phe Glu Lys Ser Lys Arg - # Ser Val Ala Leu Leu Ser Lys     Ser     #   255     -      His Asn Ser Pro Leu Val Asp Leu - # Gln Lys Gly Ala Asn Arg Ala     Thr     #                270     -      Ser Pro Leu Gly Ser Leu Trp Ser - # Phe Pro Lys Ala Leu Ile Ser     Val     #            285     -      Met Gly Ser Arg Phe Ile Phe Pro - # Val Ile Arg Leu Val Thr Ala     Leu     #        300     -      Gln Ala Tyr Asn Leu Leu Gly Lys - # Arg Cys Cys Cys His Phe His     Leu     #    320     -      Phe His Arg Phe Leu Leu Ser Phe - # Ser Gln Ala Leu Gln Gln Lys     Leu     #   335     -      Phe Val Phe Ala Pro Leu Leu His - # Leu Phe Phe Ala His Lys Pro     Pro     #                350     -      Lys Gln Lys Asp Leu Ile Leu Tyr - # Leu Ser Leu Ala Gln Ser His     Thr     #            365     -      Pro His Pro Leu Arg Gln Tyr Ser - # His Lys Arg His Asn Leu Tyr     Pro     #        380     -      Tyr Lys Pro Ala Phe Ala Cys Leu - # Cys His Gln Glu Asn Tyr Arg     Gln     #    400     -      Ile Val Pro Arg Ser Arg Ser Leu - # Thr Thr Leu Ser Tyr Leu Asn     Arg     #   415     -      Leu Ser Gln Val Ser Lys Cys Val - # Leu Leu Lys Leu Asn Gln Asn     Gln     #                430     -      Trp Ala Cys Gly Ser Ser Asp Lys - # Lys Ala Asn Ser Arg Phe Leu     Ala     #            445     -      Ser Ser His Tyr Cys Pro Lys Arg - # Tyr Ser Leu Asp Cys His Gln     Asn     #        460     -      Ser Pro Ser Arg Asn Gln Gln Asn - # Tyr Lys Leu His Gln Lys Lys     Glu     #    480     -      Ala Lys Ser Asn Val Trp Gln Lys - # Ser Asn Phe Pro Arg Lys Ser     Leu     #   495     -      Arg Trp Leu Lys Asn Leu Arg Asp - # Arg Arg Ile Pro Arg Phe Val     Ser     #                510     -      Phe Phe Gln Ser Tyr Gln Gln Thr - # Ala Leu Lys Asn Tyr His Asn     Cys     #            525     -      Leu Ala Tyr His His Tyr Pro Leu - # Ile Arg Ser Ser Asn Leu Lys     Ser     #        540     -      Cys Pro Val Leu Ser Leu His Phe - # Phe Arg Lys Pro Val Cys Pro     Leu     #    560     -      Lys Pro Val Ser Phe Ala Asn Lys - # His Ala Lys Ile Arg Cys Leu     Gly     #   575     -      Arg Phe Tyr Gln Thr Asn Arg Phe - # Val Ile Gly Arg Ile His Ser     Gln     #                590     -      Arg Asn Ile Phe His Ser Asn Arg - # His Leu Arg Phe Phe His Ala     Tyr     #            605     -      Trp Gln Asn Lys Leu Leu Ser Pro - # His His Ala Phe Phe Pro Asp     Thr     #        620     -      Ser Leu Phe Gly Cys Lys Lys Gly - # Leu His Ser Phe Pro Lys Glu     Asn     #    640     -      Cys Lys Ile Arg Ile Val Trp Ser - # Lys Arg His Leu Cys Ile Leu     Leu     #   655     -      Ile Trp Gly Gly Ile Cys Ser Thr - # Ala Leu Leu Ala Leu Ser Tyr     Pro     #                670     -      Ser Phe Gln Ala Trp Leu Ser Phe - # Glu Thr Leu Gln Asn Tyr Cys     Leu     #            685     -      Gln Asn Ala Phe Tyr Ser Arg Val - # His Glu Cys Leu Ile Asn Leu     Ala     #        700     -      Leu Trp Ser Glu Lys Asn Tyr His - # Lys Asn Phe Leu Ser Phe Ala     Ser     #    720     -      His Tyr Arg Leu Ala Asn His Lys - # Lys Asp His Ser Pro Tyr Trp     Phe     #   735     -      His Gln Arg Phe Ser Phe Cys Glu - # Asn Arg Ile Ile Gln Gly Gln     Asn     #                750     -      Ser Lys Arg Tyr Lys Lys Arg Val - # Thr Leu Asp Leu Phe Trp Lys     Trp     #            765     -      Leu Lys Asn Gln Cys Pro Pro Ile - # Leu Cys Leu Lys Glu Asn Ser     Cys     #        780     -      Gly Ser Pro His His Leu Lys Asn - # Ile His Ser Tyr Leu Leu Asp     Arg     #    800     -      Ser Leu Gln Arg Ser Gln Val Cys - # Leu Leu Cys Leu Asp Leu Ser     Gly     #   815     -      Cys His Pro Phe Leu Tyr Ala Ser - # Trp Lys Lys Cys Ser Thr Leu     Cys     #                830     -      Phe Ser Arg Pro Thr Lys Lys Val - # Phe Ile Ser Ser Leu Ala Pro     Phe     #            845     -      Phe Asp Lys Ala Tyr Asn Leu Ser - # Ile Gln Glu Leu Gln Ala Gln     Gln     #        860     -      Ser Leu Gln Ser Lys Gln Val His - # Arg Leu Glu Lys Glu Gln Ile     Ile     #    880     -      Gln Trp Leu Gln Thr Ile Ile Arg - # Asn Lys Lys Phe Gln Gly Asn     Gln     #   895     -      Thr Thr Tyr Ser Pro His Gln Thr - # Arg Leu Lys Arg Asp His Pro     Phe     #                910     -      Ala Cys Leu Leu Val Leu Leu Phe - # Phe Glu Thr Ala Ala Phe Glu     Pro     #            925     -      Leu Ala Leu Leu Ala Phe Phe Val - # His Gln Pro Ala Cys Leu Pro     Gln     #        940     -      Thr Pro Leu Ser Leu Ile Tyr Arg - # Phe Thr Cys Asn Arg Ile Lys     Asp     #    960     -      Phe Phe Glu Arg Leu Tyr Gly Ala - # Asn Met Phe Gln Lys Ile Ile     Arg     #   975     -      Ile Ile Arg Ile Glu Tyr Gly Gln - # Asn Ile Ile Leu Ile Ile Ala     Asn     #                990     -      Phe Ser Cys Phe Ser Ala Gln Gln - # Lys Ala Ala Leu Phe Val Gly     Arg     #           10050     -      Ala Asp Ala Phe Glu Ser Lys Lys - # Val Phe His Phe Arg Ile Asp     Leu     #       10205     -      Phe Tyr Ile Gln Ser His Phe Leu - # Thr Leu Phe Ile Lys Asp Phe     Phe     #  10405     -      Asn Asn Trp Ile Val Met Leu Gln - # Ile Leu Tyr Phe Phe Lys Asp     Ser     # 10550     -      Ile Asp Asn His Glu Val Lys Asn - # Phe Ile Phe Asp Asn Ile Gly     Ile     #               10700 - #                1065     -      Asp Phe Phe Glu Ser Tyr Phe Leu - # Val Phe Gln Ile Phe Asp Ser     Arg     #           10850     -      Phe Glu Thr Ile Lys Ile Ile Cys - # Ile Asn Lys Pro Ile Gly Ser     Val     #       11005     -      Phe Phe Phe Phe Leu Cys Phe Phe - # Phe Gly Phe Phe Val Ile Ile     Cys     #  11205     -      Pro Ile Arg Thr Glu Asn Asn Glu - # Thr Glu Asn Phe Gln Lys Lys     Ala     # 11350     -      Ser Phe Leu Ile Ala Ile Ile Ile - # Ile Val Asp Gln Leu Ser Leu     Leu     #               11500 - #                1145     -      His Pro Arg Gln Tyr Ser Gln Thr - # Cys Ser Asn Arg Tyr Arg Leu     Pro     #           11650     -      Pro Leu Thr Leu Trp Val Ala Arg - # Lys Gln Lys Gln Ser Pro Asp     Leu     #       11805     -      Phe Cys His Gln Arg Phe Ser Lys - # Tyr Pro Leu Cys Leu Pro Leu     Lys     #  12005     -      Arg Asn His Leu Arg Ser Leu Glu - # Gln Lys Ser Glu Pro Asn Leu     Leu     # 12150     -      Gln Arg Leu Leu Asp Trp Arg Pro - # Pro Leu Lys Asp Trp Leu Leu     Pro     #               12300 - #                1225     -      Leu Ala Leu Lys Leu Leu Gln Leu - # Ser Cys Ser Cys Gln Phe Leu     Lys     #           12450     -      Ile Ser Cys Val Ser Phe Gln Ile - # Glu Ile Asn Arg Leu Ser Ile     Ser     #       12605     -      Lys Lys Lys Val Phe Leu Tyr Tyr - # Ser Tyr Lys Arg Tyr Phe Ile     Ile     #  12805     -      Ile Lys Thr Asp Ala Phe Ala Arg - # Gln Leu Thr Ser Phe Arg Asn     Ser     # 12950     -      Lys Pro Thr Cys Pro Asn His Phe - # Ser Phe Ser Ser Arg Cys Arg     Ile     #               13100 - #                1305     -      Val Asp Leu Asp Leu Cys Ile Phe - # Ser Ser Gln Val Glu Thr Lys     Gln     #           13250     -      Ala Lys Thr Asn Pro Ser Ser Glu - # Glu Ser Ser Ile Leu Ser Asn     Ser     #       13405     -      Ile Asp His Arg Asn Phe Phe Gly - # Tyr Leu Lys Ser Val Leu Leu     Ile     #  13605     -      Cys Phe Pro Phe Cys Phe Pro Cys - # Gly Ser Cys Gly Ser Gln Thr     Leu     # 13750     -      Asn Tyr Thr Cys Tyr Ser Lys His - # Asp Thr Gln Thr Lys Leu Phe     Leu     #               13900 - #                1385     -      Glu Arg Phe Met Cys Ser Pro Leu - # Thr Ile Ser Pro Thr Ile Leu     Arg     #           14050     -      Cys Ile Phe Leu Gln Lys Asp Ser - # Phe Leu Ile Ser Cys Ser Tyr     Ser     #       14205     -      Ser Phe Ile Leu Ala Asn Phe Cys - # Leu Trp Val Lys Met Ile Val     Leu     #  14405     -      Ala Phe Arg Arg Leu Gln Arg Ser - # Gly Phe Phe Gln Tyr Leu Met     Lys     # 14550     -      Lys Lys Lys Asn Ala Tyr Cys Cys - # Ile Met Arg Met Phe Lys Asp     Met     #               14700 - #                1465     -      Asn Leu Leu Lys Arg Val Ala Ile - # Phe Ser Ser Leu His Gln Asp     Thr     #           14850     -      Asp Arg Tyr Glu Thr Arg Tyr Ser - # Lys Glu Lys Gln Leu Thr Lys     Pro     #       15005     -      Leu Thr Asn Asn His Lys Pro Lys - # Arg Leu Leu Thr Arg Ser Asn     Leu     #  15205     -      Ser Ile Ile Phe Lys Leu Phe Leu - # Lys Leu Ile Thr Leu Ser Leu     His     # 15350     -      Thr Ile Leu Ile Lys Asn Gln Ser - # Leu Ile Arg Thr Ile Gly Ile     Thr     #               15500 - #                1545     -      Gly Lys Leu Leu Lys Glu Ser Arg - # Asn Gly Lys Asn Thr Pro Ile     Lys     #           15650     -      Arg Ser Lys Ile Leu Pro Lys Arg - # Ile Ser Ile Phe Gln Thr Leu     Ser     #       15805     -      Ile Arg Ala Met Ile Ser Thr Lys - # Thr Ile Ser Leu Met Asn Leu     Pro     #  16005     -      Gln Arg Ala Phe Arg Asn Leu Gly - # Ile Ser Val Thr Glu Phe Ser     Gln     # 16150     -      Val Gly Cys Pro Ile Lys Thr Ile - # Arg Leu Lys Ser Thr Pro Asp     Arg     #               16300 - #                1625     -      Ser Glu Ile Leu Trp Lys Ile Ser - # Tyr Asn Pro Leu Ser Leu Met     Ile     #           16450     -      Lys Arg Lys Arg Ser Phe Asn Leu - # Pro Asn Asn Leu Leu Gln Glu     Ser     #       16605     -      Leu Gly Ile Lys Ser Glu Arg Ile - # Lys Ser Ser Trp Ala Cys Leu     Met     #  16805     -      Ser Pro Lys Lys Gly Lys Lys Gln - # Lys Lys Met Glu Ser Leu Leu     Val     # 16950     -      Gly Ile Gly Trp Ile Phe Phe Ser - # His Leu Tyr Leu Thr Lys Asn     Asn     #               17100 - #                1705     -      Leu Leu Met Ser Lys Lys Gln Ser - # Ile Lys Asn Gln Phe Pro Met     Ser     #           17250     -      Asn Gln Ile Pro Leu Pro Pro Pro - # Thr Tyr Lys Ala Tyr Arg Leu     Lys     #       17405     -      Leu Glu Ile Tyr Leu Met Lys Gly - # Val Ile Phe Leu Asn Ser Leu     Leu     #  17605     -      Ala Ile Trp Lys Cys Met Leu Arg - # Glu Ser Leu Thr Leu Ile Pro     Ile     # 17750     -      Thr Ser Ser Ile Asn Tyr Phe Thr - # Ile Thr Leu Cys Leu Leu Cys     Trp     #               17900 - #                1785     -      Gly Val Ile Met Ala Asn Leu Lys - # Lys Phe His Cys Cys Met Gly     Ala     #           18050     -      Met Val Val Leu Glu Leu Gly Met - # Ile Gly Thr Pro Pro Leu Val     Ile     #       18205     -      Lys Thr Asn Lys Ala Thr Met Trp - # Leu Gln Leu Met Cys Ile Lys     Thr     #  18405     -      Ala Val Ala Ser Gln Val Val Arg - # Lys Gly Leu Thr Thr Leu Val     Phe     # 18550     -      Ile Ser Thr Lys Lys Thr Asn Ser - # Gln Ala His Asn Glu His Val     Lys     #               18700 - #                1865     -      Lys Arg Ser Lys Thr Lys Ile Ser - # Trp Asn Phe Leu His Lys Ile     Met     #           18850     -      Leu Asn Thr Thr Ala Arg Lys Arg - # Arg Lys Asn Ser Glu Leu Arg     Leu     #       19005     -      Lys Ile Ser Lys Lys Thr Leu Arg - # Leu Ile Thr Pro Gly Met Ile     Val     #  19205     -      Leu Leu Leu Phe Leu Lys Lys Thr - # Gln Asn Ile Gln Leu Leu Leu     Ser     # 19350     -      Leu Val Met Gly Ile Ala Thr Leu - # Ser Lys Ile Met Gly Lys Lys     Gln     #               19500 - #                1945     -      Ile Lys Leu Ile Gly Arg Lys Met - # Leu Leu Phe Lys Val Ala Asn     Met     #           19650     -      Met Ala Cys Leu Leu Ile Ile Leu - # Ile Ser Asn Thr Pro Thr Pro     Pro     #       19805     -      Arg Ile Pro Ile Arg Val Ala Leu - # Arg Met Ala Phe Pro Ile Lys     Ala     #  20005     -      Leu Thr Arg Leu Ser Leu Ile Cys - # Leu Ile Ile Ile Ser Leu Ser     Leu     # 20150     -      Val Ser Gly Gly Ile Arg Ile Asn - # Pro Leu Lys Asp Cys Pro His     Lys     #               20300 - #                2025     -      Lys Leu Ile Ser Leu Ser Lys Ile - # Phe Ala Ala Thr Lys Asn Trp     Leu     #           20450     -      Glu Lys Leu Thr Ser Ile Lys Leu - # Leu Thr Leu Lys Thr Gln Ala     Ile     #       20605     -      Met Met Lys Lys Lys Leu Arg Lys - # Ile Leu Lys Asn Leu Gly Asn     Glu     #  20805     -      Ser Ile Arg Lys Lys Arg Lys Asn - # Trp Arg Ala Lys Ala Ala Thr     Lys     # 20950     -      Ile Lys Trp Lys Gln Lys Leu Lys - # Leu Thr Ala Lys Lys Met Arg     Phe     #               21100 - #                2105     -      Leu Arg Ser Ile Lys Arg Leu Ile - # Glu Thr Gln Glu Gln Ser Leu     Thr     #           21250     -      Leu Arg Ile Leu Lys Ala Ser Lys - # Gly Asn Cys Leu Ile Asn Leu     Lys     #       21405     -      Met Ser Thr Arg Ile Lys Thr Leu - # Ile Asn Leu Leu Met Asn Ser     Lys     #  21605     -      Met Ala Lys Ile Arg Ile Ser Ala - # Arg Gln Lys Lys His Lys Pro     Leu     # 21750     -      Lys Val Arg Lys Ile Val Ser Ile - # Gln Asn Gly Phe Gln Lys Leu     Lys     #               21900 - #                2185     -      Thr Leu Met Gln Leu Met Asn Ser - # Lys Met Ala Lys Ile Arg Ile     Ser     #           22050     -      Ala Arg Arg Lys Gln Lys Ala Thr - # Leu Lys Ile Pro Leu Lys Met     Ser     #       22205     -      Ser Ile Lys Arg Arg Ile Lys Leu - # Ile Ile Ser Ile Lys Arg Tyr     Gln     #  22405     -      Trp Leu Lys Gln Arg Val Ile Ser - # Val Gly Ser Lys Arg Pro Ile     Ser     # 22550     -      Lys Ile Ser Gln Arg Ser Asn Trp - # Pro Asn Lys Leu Lys Lys Met     Lys     #               22700 - #                2265     -      Val Ser Met Leu Glu Lys Asn Leu - # Lys Tyr Ile Asn Pro Leu Arg     Met     #           22850     -      Val Met Glu Pro Ser Val Met Gly - # Tyr Leu Lys Gln Lys Pro Gln     Leu     #       23005     -      Phe Leu Lys Thr Phe Arg Thr Ser - # Arg Lys Ser Met Gln Asn Leu     Glu     #  23205     -      Ile Ser Ile Thr Ile Thr Ile Met - # Asp Ser Lys Thr Asn Pro Phe     Met     # 23350     -      Leu Lys Leu Ile Lys Arg Lys Gln - # Gly Lys Gln Leu Ala Leu Lys     Asn     #               23500 - #                2345     -      Pro Phe Thr Leu Lys Leu Leu Lys - # Arg Met Gln Lys Leu Thr Asp     Ser     #           23650     -      Ile Lys Gln Val Val Trp Val Leu - # Gly Lys Gln Arg Ala Ser Leu     Lys     #       23805     -      Gly Met Ile Lys Leu Met Ile Ser - # Val Arg Gly Phe Gln Gly Ile     Lys     #  24005     -      Asn Trp Leu Arg Lys Leu Thr Ile - # Ser Ile Lys Arg Tyr Gln Lys     Leu     # 24150     -      Lys Gln Val Phe Leu Ala Ile Ser - # Lys Arg Thr Ser Ser Lys Ile     Leu     #               24300 - #                2425     -      Gln Asn Thr Ile Pro Ile Tyr Gly - # Leu Lys Val Gln Lys Lys Tyr     Leu     #           24450     -      Leu Val Cys Gln Arg Asn Thr Ile - # Thr Leu Leu Thr Ala Thr Tyr     Ala     #       24605     -      Leu Ile Ala Ile Ser Lys Met Glu - # Gln Ser Met Lys Lys Arg Pro     Ala     #  24805     -      Cys Arg Lys Lys Thr Leu Ser Gly - # Ser Ser Ser Met Ile Arg Leu     Arg     # 24950     -      Ile Met Glu Ala Phe Leu Cys Gln - # Ser Met Ile Lys Leu Ala Ser     Thr     #               25100 - #                2505     -      Arg Arg Ile Lys Ile Ile Leu Ile - # Arg Ser Ser Phe Pro Pro Ser     Thr     #           25250     -      Met Leu Lys Thr Leu Ile Leu Ala - # Leu Arg Asn Phe Pro Met His     Phe     #       25405     -      Leu Gln His Leu Ile Thr Ala Trp - # Arg Glu Lys Met Arg Ser Met     Glu     #  25605     -      Ser Arg Thr Leu Ile Gln Lys Val - # Val Ser Lys Asn Leu Lys Gly     Leu     # 25750     -      Arg Asn Thr Lys Asn Ala Lys Thr - # Thr Pro Cys Lys Arg Gly Val     Phe     #               25900 - #                2585     -      Tyr Ser Leu Ala Glu Ile Pro Ile - # Val Phe Ser Ile Trp Asp Glu     Cys     #           26050     -      Tyr Gln Phe Met Val Ser Tyr Pro - # His Thr Phe Val Ser Ser Val     Gly     #       26205     -      Ser Val Gln Ser Tyr Ala Phe Gly - # Asp Met Met Cys Glu Thr Cys     Arg     #  26405     -      Glu Cys Val Gly Ala Gln Thr Leu - # Asn Pro Tyr Tyr Arg Asp Thr     Glu     # 26550     -      Glu Pro Asn Ser Pro Tyr Gly Gln - # His Gln Pro Arg Lys Pro Asn     Arg     #               26700 - #                2665     -      Leu Arg Leu Gly Thr Ser Pro Asn - # Ile Pro Thr Asp Thr Asn Glu     Arg     #           26850     -      Leu Cys Ser Leu Lys Ser Ala Trp - # Ile Phe Pro Thr Pro Lys Arg     Leu     #       27005     -      Asn Pro Leu Leu Lys Ile Lys Phe - # Asp Cys Ala Ser Gly Phe Val     Leu     #  27205     -      Cys Glu Asn Leu Arg Val Ile Lys - # Arg Ala Thr Arg Lys Asn Lys     Leu     # 27350     -      Gln Arg Ser Ser Ser Lys Asn His - # Arg Ala Phe Arg Ala Asn Ser     Arg     #               27500 - #                2745     -      Ser Pro Lys Lys Asn Gln Lys Asn - # His Arg Asn Tyr His Thr Leu     Cys     #           27650     -      Pro Lys Lys Thr Leu Ala Ile Arg - # Ser Gly Lys Ile Arg Leu Gln     Glu     #       27805     -      Phe Ala Gln Asn Arg His Cys Gly - # Cys Leu Ser Arg Arg Gly Arg     Tyr     #  28005     -      Tyr Tyr Leu Phe His Arg Glu Asn - # Leu Trp Ser Phe Phe Leu Ala     Arg     # 28150     -      Phe Thr Leu Glu Cys Gly Tyr Phe - # Gly Lys Arg Lys Ser His Phe     Cys     #               28300 - #                2825     -      Gly Lys Val Phe Leu Glu Phe Thr - # Lys Arg Glu Glu Tyr Leu Arg     Tyr     #           28450     -      Phe Ala Ser Leu Ile Gly Asn Leu - # Phe Gln Arg Ser Leu Asp Ile     Leu     #       28605     -      Arg Phe Lys Thr Leu Ser Ile Ser - # Leu Lys Arg Phe Ile Ile Asn     Thr     #  28805     -      Ile Pro Leu Cys Glu Leu Pro Leu - # Phe Gly Asn Val Ile Leu Thr     Leu     # 28950     -      Asn Phe Tyr Arg Tyr Asn Leu Ser - # His Ser Leu Ala Cys Phe Ser     Ser     #               29100 - #                2905     -      Gln Ile Thr Ser Pro Leu Ala Asn - # Ser Thr Leu Gly Asn Ala Cys     Ala     #           29250     -      Phe Phe Leu Arg Ala Tyr Phe Cys - # Cys Lys Tyr Pro Thr Ile Ala     Ser     #       29405     -      Pro Glu Trp Met Ser Arg Gly Gly - # Val Glu Arg Ala Lys Cys Ser     Ile     #  29605     -      Lys Pro Ser Ile Phe Ala Ile Lys - # Gly Lys Cys Val Gln Pro Lys     Ile     # 29750     -      Ile Thr Ser Gly Lys Ser Leu Arg - # Glu Asn Asn Asn Ala Cys Lys     Phe     #               29900 - #                2985     -      Leu Thr Ile Arg Pro Leu Lys Tyr - # Phe Leu Gln Ser Lys Ala Gln     Lys     #           30050     -      Glu Lys Trp Leu Asn Ala Lys His - # Ser Asn Ser Leu Val Val Ser     Gly     #       30205     -      His Cys His Lys Arg Trp Ile Gly - # Ser Ser Leu Leu Ser Leu Ala     Leu     #  30405     -      Lys Gly Arg Phe Tyr Leu Leu Leu - # Val Ala Ser Leu Lys Cys Leu     Ala     # 30550     -      Gln Ser Arg Pro Gln Gly Phe Trp - # Asn Ala Phe Ala Ser Leu Leu     Lys     #               30700 - #                3065     -      Leu Glu Arg Ser Leu Cys Cys Met - # Pro Gln Ser Ile Ile Gln Ser     Gly     #           30850     -      Ala Val Lys Asn Pro Lys Pro Leu - # Ser Gln Ile Ala Ser Trp Gly     Leu     #       31005     -      Cys His Lys Ala Leu Pro Tyr Arg - # His Asn Arg Phe His Gln Ile     Ile     #  31205     -      Ala Leu Leu Lys Gly Phe Leu Lys - # Arg Thr Leu Pro His Arg Tyr     Gln     # 31350     -      Lys Arg Leu Phe Ser His Phe Phe - # Asn Leu Met Gly Leu Ile Arg     Asp     #               31500 - #                3145     -      Phe Ile Phe His Ser Leu Ser Leu - # Lys Ile Leu His Cys Pro Phe     Val     #           31650     -      Ala Phe Asn Arg Gln Ser              3170     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3177 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:4: (xi) SEQUENCE DESCRIPTION: SEQ     -      Cys Ser Val Ile Ser Pro Ser Ile - # Ser Ser Ser Met Lys Ser Asn     Ser     #   15     -      Ser Phe Ser Ser Lys Arg Leu Glu - # Lys Leu Ser Lys Ile Val Lys     Thr     #                 30     -      Ser Phe Ser Lys Phe Pro Ile Ile - # Phe Val His Ala Asn Phe Val     Ser     #             45     -      Phe Tyr Ser Ser Ile Cys Tyr Ile - # Cys Leu Lys Ala Tyr Leu Ser     Met     #         60     -      Ile Leu Trp Phe Trp Ile Ile Leu - # Ser Ile Ser Leu Thr Asn Thr     Val     #     80     -      Ser Val Asp Lys Ile Phe Lys Tyr - # Ser Leu Gly Met Pro Leu Lys     Leu     #   95     -      Lys Leu Ala Ile Thr Leu Gly Leu - # Pro Ser Cys Leu Arg Ile Phe     Leu     #                110     -      Ser Ser Pro Leu Val Met Ile Ser - # Asn Ser Phe Ser Val Thr Leu     Ala     #            125     -      Asn Leu Trp Ser Glu Arg Leu Pro - # Pro Ser Phe Leu Lys Lys Ile     Phe     #        140     -      Val Gly His Cys Ser Leu Ile Val - # Ser Ala Ile Gly Gln Ala Lys     Arg     #    160     -      Ser Val Met Leu Val Ala Ser Leu - # Thr Ile Ala Phe Leu Phe Leu     Ala     #   175     -      Val Phe Ser Met Ser Leu Thr Lys - # Ala Thr Phe Gly Ser Leu Asn     Ile     #                190     -      Ser Arg Leu His Gln Tyr Leu Arg - # Gln Ile Tyr Ala His Pro Leu     Pro     #            205     -      Leu Gly Tyr Glu Arg Lys Gly Lys - # Asn Lys Arg Ala Lys His His     Tyr     #        220     -      Cys Leu Arg Asn Leu Thr His Gln - # Arg Gln Ser Phe Leu Lys Asn     Leu     #    240     -      Ser Asp Leu Leu Leu Tyr Tyr Gln - # Lys Ala Ile Glu Ile His His     Trp     #   255     -      Leu Ile Cys Lys Lys Ala Leu Ile - # Ala Arg Gln Ala His Asp His     Cys     #                270     -      Gly Leu Ser Arg Lys His Val Glu - # Trp Asp Asn Leu Asp Ser Tyr     Phe     #            285     -      Leu Gly Trp Leu Leu Arg Cys Lys - # Arg Ile Arg Ile Cys Ala Lys     Arg     #        300     -      Asp Ala Val Ala Ile Phe Ile Phe - # Phe Ile Ala Phe Ser Cys Val     Phe     #    320     -      His Lys Leu Phe Asp Asn Arg Asn - # Ser Cys Leu Phe Leu Leu His     Cys     #   335     -      Cys Ile Cys Phe Leu His Thr Ser - # Arg Pro Ser Lys Arg Ile Ser     Cys     #                350     -      Ile Cys Pro Leu Asn Leu Asp Ile - # Leu Pro Thr His Cys Asp Asn     Ile     #            365     -      Pro Ile Ser Ala Ile Ile Phe Ile - # His Ile Asn His Ser Glu Leu     Leu     #        380     -      Leu Val Phe Ala Asp Ile Ser Arg - # Lys Ile Ile Gly Lys Leu Ser     His     #    400     -      Ser Val Gln Gly His His Lys His - # Cys Leu Thr Thr Gly Ala Lys     Tyr     #   415     -      Gln Ser Val Ser Cys Ser Thr Lys - # Ile Ser Gly His Val Glu Val     Gln     #                430     -      Ile Lys Arg Arg Ile Asp Leu Ala - # Phe Arg His His Thr Ile Ala     Pro     #            445     -      Ser Val Ile Leu Leu Ile Ala Ile - # Lys Thr His Ser Pro His Ser     Asp     #        460     -      Glu Ile Ser Lys Ile Ile Arg Ser - # Tyr Ile Lys Asn Lys Arg Lys     Arg     #    480     -      Asp Lys Arg Ala Met Phe Gly Lys - # Lys Val Ile Phe His Arg Glu     Ser     #   495     -      Arg Cys Val Gly Lys Thr Phe Ser - # Val Lys Thr Arg Asp Ala Phe     Leu     #                510     -      Asn Ala Leu Phe His Phe Phe Lys - # Ala Ile Ser Lys Gln His Arg     Ile     #            525     -      Thr Ile Thr Ala Arg Asn Ile Thr - # Ile Thr His Phe Ala Gln Ala     Thr     #        540     -      Ser His Ala Leu Phe Phe Arg Ala - # Ser Thr Asn Asn Asn Ser Ser     Arg     #    560     -      Gly Glu Ser Gln Phe Val Leu Asn - # Leu Val Phe Arg Arg Ile Ser     Met     #   575     -      His Glu Asn Lys Phe Ala Val Gly - # Val Asp Phe Ile Lys Gln Ile     Ala     #                590     -      Leu Phe Arg Gly Val Phe Thr Asn - # Leu Asn Ser Ala Lys Ile Phe     Phe     #            605     -      Thr Gln Ile Val Ile Asp Phe Ser - # Met Leu Ile Gly Lys Ile Asn     Phe     #        620     -      Phe His Pro Thr Met His Phe Phe - # Leu Glu Ile Gln Val Phe Ser     Val     #    640     -      Ala Asn Lys Arg Gly Phe Ile His - # Ser Gln Lys Lys Ile Val Lys     Phe     #   655     -      Ala Leu Phe Gly Val Asn Asn Ala - # Ile Phe Ser Val Phe Phe Ser     Ser     #                670     -      Gly Val Val Phe Ala Gln Gln Leu - # Phe Tyr Glu Leu Tyr Pro Ile     Leu     #            685     -      Glu Val Phe Lys Leu Gly Ser Asn - # Asn Pro Leu Lys Leu Phe Lys     Ile     #        700     -      Ile Val Cys Lys Met Leu Phe Ile - # His Asn Gly Phe Ile Glu Ser     Val     #    720     -      Ile Leu Leu Tyr Gly Gln Lys Lys - # Thr Asn Ile Ile Lys Thr Phe     Ser     #   735     -      Leu Leu Leu Leu Ile Ile Gly Ser - # His Glu Ile Ile Arg Lys Asn     Lys     #                750     -      Thr Ile Val Pro Ile Gly Phe Ile - # Asn Asn Asp Phe Leu Phe Asn     Ser     #            765     -      Val Lys Ile Glu Phe Glu Phe Arg - # Gly Asn Lys Ile Pro Lys Asn     Ala     #        780     -      Ile Asn Arg Ser Val Leu Tyr Glu - # Ile Phe Phe Gly Ser Gly Lys     Ile     #    800     -      Asn Ala Pro Gln Phe Asp Cys Ala - # Lys Arg Ile Val Val Ser Lys     Asp     #   815     -      His Pro Ile Ile Lys Thr Phe Ile - # Val Ile Cys Ile Gly Val Cys     Ser     #                830     -      Ala His Lys Ser Val Phe Ser Val - # Trp Ile Val Asp Val Ile Ser     His     #            845     -      Phe Phe Thr Arg Leu His Gly Lys - # Asn Ala His Glu Arg Phe Val     Phe     #        860     -      His Ala Leu Lys Gln Lys Arg Phe - # Phe Lys Phe His Arg Ser Leu     His     #    880     -      Ser Leu Ile Lys Pro Ile Ile Phe - # Leu Glu Ser Lys Ser Tyr Lys     His     #   895     -      Asn Asn Asn Arg Tyr Asn Gln Asn - # Arg Phe Ile Gly Leu Lys Lys     Asn     #                910     -      Asn Arg Tyr Asn Gly Tyr Lys Gln - # Tyr Lys Tyr Arg Gly Ile Asn     Lys     #            925     -      Ser Phe Arg Glu Thr Lys Gln Pro - # Ile Pro Pro Ile Lys Gln Asp     Phe     #        940     -      Lys Lys Gly Ile Asp Thr Leu Leu - # His Val Cys Phe Lys Phe Phe     Tyr     #    960     -      Phe Leu Lys Leu Pro Leu Leu Asn - # Leu Leu Leu Tyr Asn Leu Phe     Leu     #   975     -      Phe Ile Ser Leu Leu Ala Cys His - # Lys His Leu Ser Leu Tyr Arg     Tyr     #                990     -      Thr Ala Ser His Val Ile Val Lys - # Ile Phe Leu Arg Asp Ser Thr     Val     #           10050     -      Leu Ile Cys Phe Lys Arg Ser Leu - # Gly Ser Glu Leu Asn Thr Ala     Asn     #       10205     -      Lys Thr Leu Tyr Asn Leu Ser Ser - # His Arg Ile Ser Leu Val Ser     Pro     #  10405     -      Arg Asn Asp Arg Lys Gln Arg Cys - # Leu Leu Val Val Leu Met Leu     Leu     # 10550     -      Lys Val Lys Lys Ser Phe Thr Ser - # Gly Leu Ile Cys Asn Ser Thr     Phe     #               10700 - #                1065     -      Asn Pro Ile Ser Leu Pro Phe Ser - # Ser Lys Ile Phe Ser Ile Thr     Gly     #           10850     -      Ser Cys Phe Lys Ser Phe Ile Phe - # Leu Arg Thr Leu Leu Thr Ile     Thr     #       11005     -      Lys Ser Lys Thr Ser Ser Leu Ile - # Ile Ser Gly Leu Thr Ser Leu     Lys     #  11205     -      Val Thr Phe Leu Ser Phe Lys Phe - # Leu Ile Val Ala Leu Lys Leu     Ser     # 11350     -      Lys Ser Lys Phe Val Thr Ser Pro - # Leu Gly Val Phe Phe Ser Phe     Ser     #               11500 - #                1145     -      Cys Ala Ser Phe Leu Ala Ser Leu - # Ser Ser Phe Ala Asn Pro Tyr     Glu     #           11650     -      Leu Lys Thr Thr Arg Leu Lys Arg - # Thr Phe Lys Lys Lys Pro Leu     Ser     #       11805     -      Phe Leu Leu Leu Leu Leu Leu Leu - # Leu Ile Asn Leu Ala Ser Ser     Ser     #  12005     -      Thr Leu Ala Asn Ile Glu Ala Lys - # Leu Ser Ala Gln Ile Asp Asn     Thr     # 12150     -      Ala Cys His Arg His Ser Tyr Gly - # Leu Arg Val Ser Lys Asn Ser     Arg     #               12300 - #                1225     -      Leu Thr Ser Phe Asn Val Ile Arg - # Asp Phe Pro Asn Ile Arg Tyr     Ala     #           12450     -      Phe Asp Pro Ser Ala Thr Thr Tyr - # Asp Arg Asn Arg Asn Asp Leu     Asn     #       12605     -      Asn Gln Thr Phe Ser Cys Ser Asp - # Ala Phe Cys Arg Thr Gly Asp     Arg     #  12805     -      His Arg Ile Gly Cys Tyr His Pro - # Ser Tyr Tyr Asn Phe Leu Val     Ala     # 12950     -      Val Ser Asp Ser Cys Lys Lys Phe - # His Ala Phe Pro Phe Lys Leu     Lys     #               13100 - #                1305     -      Ser Ile Ala Val Tyr Gln Lys Lys - # Lys Tyr Phe Tyr Thr Ile His     Thr     #           13250     -      Ser Ala Thr Leu Phe Lys Ser Lys - # Pro Thr Leu Leu Leu Gly Asn     His     #       13405     -      His Ser Gly Ile Val Asn Leu Leu - # Val Pro Thr Ile Phe Leu Ser     Gln     #  13605     -      Val Val Val Glu Leu Ile Phe Arg - # Ile Phe Asp Val Phe Phe Asn     Arg     # 13750     -      Leu Arg Leu Lys Pro Lys Asn Lys - # Gln Lys Gln Thr Gln Ala Asp     Gln     #               13900 - #                1385     -      Ser Glu Asn Lys Ala Pro Phe Ala - # Thr Pro Thr Thr Lys Glu Thr     Phe     #           14050     -      Phe Glu Ala Ile Phe Glu Asn Leu - # Ser Tyr Phe Val Phe His Phe     Val     #       14205     -      Ser His Val Asp Leu Val Asp His - # Lys Arg Leu Ile Ile His Ala     Ile     #  14405     -      Val Ser Met Thr His Lys Pro Asn - # Tyr Phe Asn Ala Ser Cys Ala     His     # 14550     -      Leu Asp Pro Phe Leu Gln Pro Tyr - # Phe Ser Val Ala Phe Asp Phe     Phe     #               14700 - #                1465     -      Lys Lys Ile His Phe Leu Phe Leu - # Val Leu Ile Lys Val Leu Ser     Phe     #           14850     -      Gln Ile Phe Val Asn Cys Gly Lys - # Cys Glu Ser Ser Pro Leu Asp     Ala     #       15005     -      Cys Asn Asp Arg Ala Phe Phe Asn - # Ile Asn Asn Asp Lys Lys Lys     Lys     #  15205     -      Met Leu Asp Ile Val Val Glu Cys - # Ser Lys Thr Ile Asp Tyr Ser     Ser     # 15350     -      Val Arg Phe Leu Ala Val Phe Asp - # Thr Asn Lys Ile Pro Ile Gly     Met     #               15500 - #                1545     -      Lys Leu Gly Ile Val Arg Arg Asn - # Asn Asp Arg Asn His Pro Thr     Thr     #           15650     -      Thr Asn Arg Ser Gly Phe Pro Ala - # Ala Ile Tyr Gln Ser Ser Ser     Ser     #       15805     -      Phe Ser Ser Arg Cys Arg Phe Ile - # Arg Ser Ser Lys Thr Asn Arg     Glu     #  16005     -      Arg Gly Gln Ala Ser Phe Arg Asn - # Leu Ala Ile Lys Gly Arg Ile     Leu     # 16150     -      Gln Ser Asp Gln Lys Ser Tyr Gln - # Lys Glu Ser Val Phe Phe Arg     Leu     #               16300 - #                1625     -      Tyr Gln Glu Gln Phe Asn Gln Gln - # Arg Gln Ser His Cys Arg Ile     Phe     #           16450     -      His Lys Glu Leu Ser Glu Ile Trp - # Gly Ser Ala Leu Pro Asn Phe     His     #       16605     -      Lys Leu Gly Val Pro Ser Lys Arg - # Ser Val Asn Gln His Pro Ile     Asp     #  16805     -      Pro Lys Phe Tyr Gly Lys Tyr His - # Thr Thr Pro Tyr Pro Arg Glu     Ser     # 16950     -      Gly Val Phe Glu Ile Cys Gln Thr - # Ile Phe Cys Arg Asn His Tyr     Arg     #               17100 - #                1705     -      Glu Ser Asn Pro Asn Gly Ser Lys - # Val His Gly Arg Val Val Leu     Glu     #           17250     -      Arg Lys Ala Arg Ser Arg Lys Lys - # Trp Arg Ala Tyr Trp Trp Gly     Leu     #       17405     -      Val Gly Tyr Phe Ser Leu Ile Tyr - # Ile Gln Lys Thr Ile Phe Cys     Gln     #  17605     -      Arg Ser Asn Gln Ser Arg Thr Ser - # Ser Pro Cys Pro Thr Arg Tyr     Ser     # 17750     -      His Tyr His His Arg His Thr Arg - # Leu Thr Ala Ser Arg Phe Thr     Lys     #               17900 - #                1785     -      Gly Phe Phe Ile His Ser Trp Arg - # Tyr Gly Asn Val Arg Cys Gly     Ser     #           18050     -      Arg His Ser Gln Leu Gln Val Gln - # Ser Ile Ile Asp Ser Gln Arg     Ser     #       18205     -      Val Phe Cys Val Asn Gly Glu Ser - # Trp His Arg Thr Lys Ser Phe     Ile     #  18405     -      Val Val Trp Gly Gln Trp Trp Ser - # Trp Ser Ala Leu Glu Arg His     Arg     # 18550     -      Trp Leu Arg Pro Thr Arg Gln Gln - # Cys Gly Tyr Asn Asn Cys Ala     Tyr     #               18700 - #                1865     -      Glu Lys Arg Gln Trp Leu Ser His - # Ser Arg Trp Glu Arg Asp Gln     Pro     #           18850     -      Phe Leu Ser Leu Gln Arg Arg Pro - # Thr His Arg Leu Thr Thr Ser     Ile     #       19005     -      Lys Ser Arg Arg Asp Pro Lys Gln - # Asn Arg Phe His Gly Ile Ser     Cys     #  19205     -      Thr Lys Cys Ile Arg Gln Leu Glu - # Arg Glu Arg Glu Gly Lys Ile     Pro     # 19350     -      Asn Asp Arg Phe Pro Lys Arg Leu - # Gly Leu Phe Arg Arg Pro Arg     Glu     #               19500 - #                1945     -      Ser Tyr Cys Phe Cys Phe Lys Arg - # His Lys Thr Phe Ser Phe Asn     Tyr     #           19650     -      Val Trp Trp Gly Phe Glu Leu His - # Ser Gln Arg Leu Trp Glu Lys     Ser     #       19805     -      Arg Ser Phe Arg Gly Glu Lys Cys - # Tyr Ser Ser Arg Pro Lys Thr     Trp     #  20005     -      Arg Asp Val Cys Leu Phe Phe Gln - # Ile His Gln Arg Leu Gln Glu     Ser     # 20150     -      Gln Gly Cys Arg Arg Tyr Glu Trp - # Arg Phe Pro Phe Arg Ser Arg     Leu     #               20300 - #                2025     -      Gln Gly Ser Tyr Leu Phe Ala Phe - # Lys Ser Arg Tyr His Phe Arg     Lys     #           20450     -      Ala Glu Phe Arg Gly Thr Asn His - # Arg Ile Val Pro Thr Arg Ser     Ala     #       20605     -      Tyr Gln Arg Phe Phe Glu Gln Gln - # Gln Arg Ile Gly Trp Lys Asn     Phe     #  20805     -      Lys Leu Gln Ser Cys Ser Arg Lys - # His Arg Gln Leu Ser Glu Lys     Ser     # 20950     -      Ser Glu Arg Ser Lys Ile Ser Lys - # Glu Thr Arg Ala Phe Arg Glu     Arg     #               21100 - #                2105     -      Ser Arg Glu Lys Ile Gly Glu Gln - # Lys Arg Gln Gln Lys Asn Gly     Ser     #           21250     -      Lys Ser Ser Ser Gln Pro Lys Arg - # Asp Phe Cys Val Asp Gln Arg     Gly     #       21405     -      Arg Arg Lys Ser Asn Arg Leu Arg - # Ser Glu Ser Arg His Gln Lys     Gly     #  21605     -      Ile Val Thr Lys Cys Gln Gln Glu - # Phe Glu Arg Leu Ile Phe Ile     Gln     # 21750     -      Lys Trp Gln Lys Gly Phe Gln Gln - # Gly Arg Arg Asn Thr Lys Ser     Pro     #               21900 - #                2185     -      Arg Phe Gly Glu Arg Phe Arg Tyr - # Gln Ser Arg Met Asp Phe Lys     Ser     #           22050     -      Lys Pro Cys Ser Phe Glu Ile Gln - # Lys Trp Gln Lys Gly Phe Gln     Gln     #       22205     -      Gly Asn Ala Ser Lys Lys Arg Pro - # Lys Phe Arg Arg Cys Asp His     Gln     #  22405     -      Ser Lys Gly Asn Gly Ser Ser Gln - # Ser Ser Gly Ile Ser Gly Ser     Asn     # 22550     -      Gly Phe Gln Gly Arg Ala Ser Val - # Ser Arg Ser Gln Lys Phe Leu     Lys     #               22700 - #                2265     -      Gly Ala Ile Gly Pro Thr Ser Ser - # Lys Lys Lys Ser Gln Cys Lys     Lys     #           22850     -      Ile Asn Ile Ser Ile Arg Glu Trp - # Cys Glu Trp Asn Pro Ser Arg     Trp     #       23005     -      Val Ile Ser Ser Arg Ser His Asn - # Ser Phe Lys Leu Phe Gly His     Gln     #  23205     -      Glu Arg Val Glu Cys Lys Thr Trp - # Lys Phe Gln Gln Gln Trp Thr     Gln     # 23350     -      Lys Arg Thr His Leu Cys Ser Lys - # Glu Ser Arg Ala Ser Ser Pro     Arg     #               23500 - #                2345     -      Thr His Leu Arg Ser Ser Cys Lys - # Gly Lys Cys Lys Asn Pro Thr     Gln     #           23650     -      Ser Asn Ser Lys Trp Phe Gly Cys - # Cys Arg Ala Ser Ser Gly Leu     Pro     #       23805     -      Phe Glu Lys Ala Ser Ser Gln Gly - # Arg Ala Phe Lys Glu Ser Arg     Ile     #  24005     -      Gly Ser Glu Asn Gln Ser Gln Ser - # Ser Gly Ile Arg Ser Ser Arg     Phe     # 24150     -      Phe Trp Gln Ser Arg Ala Asn Asp - # Arg Gln Ala Gln Arg Phe Tyr     Lys     #               24300 - #                2425     -      Thr Gln Ser His Glu Ser Met Gly - # Lys Cys Lys Lys Ser Thr Cys     Phe     #           24450     -      Val Ser Glu Thr Arg Gln Leu Arg - # Tyr Gln Pro His Thr His Gln     Tyr     #       24605     -      Gln Lys Trp Ser Asn Gln Lys Ser - # Asp Arg His Ala Asn Ala Lys     Lys     #  24805     -      Pro Val Ala Gln Ala Arg Glu Asp - # Ser Cys Ala Cys Arg Lys Arg     Ser     # 24950     -      Phe Val Arg Val Asn Trp Leu Gln - # Pro Glu Glu Tyr Glu Arg Leu     Phe     #               25100 - #                2505     -      Phe Val Gln Val Phe His Gln Val - # Glu Gln Cys Cys Lys Arg His     Phe     #           25250     -      Trp Leu Tyr Ala Ile Phe Asn Gln - # Cys Ile Phe Tyr Ser Ile Leu     Leu     #       25405     -      Leu Leu Gly Glu Arg Lys Cys Gly - # Ala Trp Asn Gln Glu Arg Tyr     Lys     #  25605     -      Arg Trp Phe Pro Lys Ile Leu Lys - # Asp Gly Ile Pro Lys Thr Gln     Lys     # 25750     -      Pro Pro Leu Ala Lys Ser Glu Gly - # Phe Phe Asn Thr Pro Gln Lys     Ser     #               25900 - #                2585     -      Gln Ser Ser Leu Val Phe Gly Met - # Asn Ala Thr Asn Ser Trp Tyr     His     #           26050     -      Ile Pro Ile His Ser Tyr Leu Ala - # Glu Val Cys Lys Val Thr Pro     Leu     #       26205     -      Glu Ile Cys Val Arg Pro Val Gly - # Asn Ala Leu Glu Leu Lys Leu     Cys     #  26405     -      Lys Ile Pro Ile Ile Gly Thr Gln - # Ser Glu Asn Gln Thr Leu Pro     Thr     # 26550     -      Gly Asn Ile Ser Leu Gly Ser Pro - # Ile Val Phe Ser Gly Trp Ala     Leu     #               26700 - #                2665     -      His Leu Lys Ile Ser Arg Gln Thr - # Leu Thr Lys Gly Phe Val Leu     Ser     #           26850     -      Leu His Gly Tyr Phe Leu Pro Gln - # Lys Asp Leu Thr Leu Cys Leu     Lys     #       27005     -      Leu Ser Leu Ile Val Leu Val Gly - # Ser Cys Tyr Ser Ala Lys Ile     Asn     #  27205     -      Gly Leu Arg Glu His Lys Leu Glu - # Lys Thr Ser Ser Tyr Asn Lys     Asp     # 27350     -      Gln Val Gln Lys Ile Ile Glu Leu - # Leu Glu Gln Ile Asp Arg Ala     Leu     #               27500 - #                2745     -      Asn Gln Arg Lys Ile Arg Lys Thr - # Ile Gly Ile Ile Thr Pro Tyr     Asn     #           27650     -      Ala Gln Lys Arg Arg Leu Arg Ser - # Glu Val Glu Lys Tyr Gly Phe     Lys     #       27805     -      Asn Phe Asp Glu Leu Lys Ile Asp - # Thr Val Asp Ala Phe Gln Gly     Glu     #  28005     -      Glu Ala Asp Ile Ile Ile Tyr Ser - # Thr Val Lys Thr Cys Gly Asn     Leu     # 28150     -      Ser Phe Leu Leu Asp Ser Lys Arg - # Leu Asn Val Ala Ile Ser Arg     Ala     #               28300 - #                2825     -      Lys Glu Asn Leu Ile Phe Val Gly - # Lys Lys Ser Phe Phe Glu Asn     Leu     #           28450     -      Arg Ser Asp Glu Lys Asn Ile Phe - # Ser Ala Ile Leu Gln Val Cys     Arg     #       28605     -      Val Ile Phe Ser Lys Asp Asn His - # Thr Phe Phe Ala Ser Lys Arg     Phe     #  28805     -      His Lys Ser Leu Ser Ala Leu Ser - # Thr Gln Tyr Pro Tyr Ser Val     Ser     # 28950     -      Tyr Ser Pro Phe Leu Gly Ile Glu - # Leu Phe Leu Ile Phe Ile Ser     Val     #               29100 - #                2905     -      Thr Ile Ala Ile Leu Leu Val Phe - # Leu Ala Arg Ser His Arg Arg     Ser     #           29250     -      His Glu Ile Pro Leu Gly Met Arg - # Val His Phe Phe Gly Arg Ile     Phe     #       29405     -      Ala Ala Asn Ile Leu Gln His Arg - # Pro Asn Gly Val Gly Gly Val     Leu     #  29605     -      Lys Gly Gln Asn Ala Pro Asn Ser - # Pro Gln Phe Phe Glu Arg Leu     Arg     # 29750     -      Glu Asn Ala Cys Asn Leu Lys Ser - # Leu Arg Glu Asn Leu Gly Ser     Glu     #               29900 - #                2985     -      Ile Ile Thr His Ala Ser Phe Gln - # Phe Ala Leu Asn Thr Phe Phe     Asn     #           30050     -      Gln Arg His Lys Lys Arg Ser Gly - # Met Arg Asn Ile Gln Ile Ala     Leu     #       30205     -      Leu Phe Gln Gly Ile Val Ile Ser - # Val Gly Leu Asp Arg Arg Phe     Cys     #  30405     -      Pro His Asn Arg Gly Val Phe Ile - # Phe Tyr Leu Ser Leu Asp Arg     Asn     # 30550     -      Ala Trp Leu Asn His Ala His Asn - # Arg Asp Phe Gly Met Leu Leu     His     #               30700 - #                3065     -      Leu Phe Ser Ser Ala Arg Cys Val - # Ala Cys His Asn Gln Phe Asn     Leu     #           30850     -      Val Arg Phe Glu Lys Ile Gln Ser - # Leu Ala Lys Leu Leu Asp Arg     Ser     #       31005     -      Gly Val Phe Ser Ala Ile Arg His - # Ser Ser Arg Ile Ala Ile Ile     Asp     #  31205     -      Asp Phe Ile Lys Leu Arg Phe Lys - # Ala Phe Asn Ala Lys Pro Ser     His     # 31350     -      Thr Ala Ile Lys Asn Ala Tyr Phe - # His Asp Thr Phe Leu Ile Trp     Asp     #               31500 - #                3145     -      Leu Gly Ile Leu Phe Phe Ile His - # Val Lys Phe Phe Ile Val Leu     Ser     #           31650     -      Leu Leu His Phe Arg Ile Asp Lys - # Ala     #        3175     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 10299 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:5: (xi) SEQUENCE DESCRIPTION: SEQ     - AAGCTTTGTC TATTCTAAAA TGCAACAAAC TAAGGACAAT GAAGAATTTT TA - #AACTTAAT       60     - GAATGAAAAA TAAAATCCCT AATTAATCCC ATTAAATTAA AAAAGTGTCA TG - #AAAATAGG      120     - CGTTTTTGAT AGCGGTGTGG GAGGGTTTAG CGTTTTAAAA AGCCTTTTAA AA - #GCGCAATT      180     - ATTTGATGAA ATCATCTATT ATGGCGATAG CGCTAGAGTG CCTTATGGCA CT - #AAAGACCC      240     - CACTACGATC AAGCAATTTG GCTTAGAGGC TTTGGATTTT TTCAAACCGC AC - #CAGATTGA      300     - ATTATTGATT GTGGCATGCA ACACAGCGAG CGCTCTAGCT TTAGAAGAGA TG - #CAAAAGCA      360     - TTCCAAAATC CCTATTGTGG GCGTGATTGA GCCAAGCATT TTAGCGATCA AG - #CGACAAGT      420     - AAAAGATAAA AACGCCCCTA TTTTAGTGCT AGGGACAAAA GCGACGATCC AA - #TCCAACGC      480     - TTATGACAAT GCCCTGAAAC AACAAGGCTA TTTGAATGTT TCGCATTTAG CC - #ACTTCTCT      540     - TTTTGTGCCT TTGATTGAAG AAAGTATTTT AGAGGGCGAA TTGTTAGAAA CT - #TGCATGCG      600     - TTATTATTTC ACTCCCTTAA AGATTTTCCC GAAGTGATTA TTTTAGGTTG CA - #CGCATTTT      660     - CCCTTAATCG CTCAAAAAAT TGAGGGCTAT TTTATGGAGC ATTTTGCCCT TT - #CAACACCC      720     - CCCCTACTCA TCCATTCGGG CGATGCTATT GTAGGATATT TGCAGCAAAA AT - #ACGCCCTT      780     - AAAAAAAATG CACACGCATT CCCTAAAGTG GAATTTCATG CGAGCGGCGA TG - #TGATCTGG      840     - CTAGAAAAAC AAGCTAAAGA ATGGCTCAAA TTGTAACGCT AATAAAAATT TA - #AAGTCAAA      900     - ATAACTCAAT TCCCAAAAAG GGGCTATAGC TCACACTATA AGGGTATTGT GT - #TGATTATA      960     - AAGCGCTTTA GAGAGATTTA TGAAAGCGTT TTGAAGCGAA GAATGTCTAA TG - #ATTATCTT     1020     - TGGAAAAGAT TACCTATCTA CAGACTTGCA AAATAGCGCT AAAGATATTC TT - #CTCATCGC     1080     - TTCGTAAATT CTCAAAGAAA GACTTTTTAC CCACAAAAAT GAGATTTTCT TT - #TGCCCTAG     1140     - AAATAGCCAC ATTCAAGCGT TTAGAATCTA GCAAGAAAGA AAGATTACCA CA - #AGTTTTCA     1200     - CGGTGGAATA AATAATAATA TCTGCCTCTT CACCTTGAAA GGCATCCACA GT - #GTCTATTT     1260     - TGAGCTCATC AAAATTCTTG AAGCCGTATT TTTCCACTTC TGATCGCAAG CG - #TCTTTTTT     1320     - GGGCATTATA AGGTGTGATA ATTCCTATGG TTTTTCTGAT TTTTCTTTGG TT - #AAGAGCGC     1380     - GATCAATTTG CTCTAAAAGC TCTATGATTT TTTGAACTTG ATCTTTGTTA TA - #GCTACTTG     1440     - TTTTTTCTAG TTTATGCTCT CTTTATAACC CTTAATTAAT TTTCGCACTA TA - #GCACGAAC     1500     - CCACTAGCAC AATCAAACTT AATTTTAAGC AAAGGGTTAA GTCTTTTTGG GG - #TAGGAAAT     1560     - ATCCATGCAG ACTTTAAAGA ACAAAGCCTT TCGTTAGTGT CTGTCGGGAT AT - #TTTAAGGT     1620     - GAAGTGCCCA ACCGCTAAAG ACGATTGGGC TTCCTAGGCT GATGTTGCCC GT - #AGGGAGAG     1680     - TTTGGTTCTC ACTCTGTGTC CCTATAATAG GGATTTTACA GAGTTTGAGC TC - #CAACGCAT     1740     - TCCCTACAGG TCTCACACAT CATATCTCCA AAGGCGTAAC TTTGCACACT TC - #CTACGCTA     1800     - GATACGAATG TATGGGGATA TGATACCATG AATTGGTAGC ATTCATCCCA AA - #TACTAAAG     1860     - ACGATTGGGA TTTCTGCTAA GGAGTATTAA AAAACCCCTC GCTTTTAGCA AG - #GGGTGGTT     1920     - TTTGCGTTTT TGGTATTCCT TAATCCTTTA AGATTTTTGG AAACCACCTT TT - #GTATTAAC     1980     - GTTCTTGATT CCATGCTCCG CATTTTCTCT CGCCAAGCAG TAATAAGATG CT - #GTAGAAAA     2040     - TGCATTGGTT AAAAATTGCG TAAAGCCAGA ATTAGTGTCT TTTACAGCAT TG - #TTCAACTT     2100     - GGTGGAAAAC TTGAACGAAT CAGAATAATC TTTCATATTC TTCTGGTTGA AG - #CCAATTTT     2160     - ATCATACTCT GACAAAGGAA CGCTTCCTAC ATTATGCGCA ACTATCTTAT CA - #TTCACGAG     2220     - CTTGAGCCAC TCAGGGTTTT TTTGCGTTAG CATGCCGGTC GCTTTTTCAT TG - #ATTGCTCC     2280     - ATTTTTGATA TTGCTATTAA TGCGTATGTG GCTGTTAGTA GCGTAATTGT CT - #AGTTTCGC     2340     - TGACAAACTA GCAGGTACTT TTTTTGCACT TTCAACCCAT AGATTCATGG GA - #TTGTGTTT     2400     - TGTAGAATCT TTGAGCTTGT CTATCGTTTG CTCTAGATTG CCAAAAAAAC CT - #GCTTTAGC     2460     - TTCTGATACC GCTTGATTGA GATTGTCAAT TTTCTGAGCC AATTCTTGAT TC - #CTTGAAAG     2520     - CCCTACCTTA CTGAGATCAT CAACTTTATC ATGCCTTTTC AAAGGGAAGC CC - #GCTGCTTG     2580     - CCCTACAACA CCCAAACCAC TTGCTATTTG ATTGAGTCGG TCAATTTTTG CA - #TTTACCTT     2640     - TTTAGCAACT TGAGCGTAAA TGGGTTCTTC AAGGCTAGCT GCTTGCCCTG CT - #TTCTTTTT     2700     - ATTAACTTTA GCATAAATGG GTTCGTTTTT GAGTCCATTA TTGTTATTGT TA - #TTGAAATT     2760     - TCCAAGTTTT GCATTCAACT CTTTCTTGAT GTCCGAAAAG TTTTTAGAAA GA - #GTTGTGGC     2820     - TTCTGCTTGA GATAACCCAT TACCGACTAG GGTTCCATTC ACACCATTCT TA - #ACGGATTG     2880     - ATATATTTCA GATTTTTTTC TAGCATTGAG ACTTTCATTT TTTTGAGCTT GT - #TGGGCCAA     2940     - TTGCTCCTTT GAGAAATTTT TGAGATCGGC TAACGCTTGC TCTACCCTAC TG - #AAATCACC     3000     - CGTTGCTTTA GCCACTGATA CCGCTTGATT GAGATTATCA ACTTTATCCG TT - #ACCTTTTG     3060     - ATTGATGATC ACATCTTTAA CGGAATTTTC AAGGTCGCTT TTTGCTTGCG TT - #ACCTTGCT     3120     - GAAATCCTTA TTTTTGCCAT TTTTGAATTC ATTCAAAGCT GCATTAAGGT TT - #TCAACTTT     3180     - TGAAATCCAT TCTGGATTGA TACCTAAATC TTTCACCGAA CCTTTAAGGG CT - #TTTAGTGT     3240     - TTCTTCTGCC TTGCTGAAAT CCTTATTTTT GCCATTTTTG AATTCATCAA AA - #GATTTATC     3300     - AAAGTCTTTC AAATTCTTGT TGACATTTTC AAGTTTATCA GACAATTCCC TT - #TTGATGCC     3360     - TTTAAGATTC TGAGCGTAAG CGATTGCTCT TGCGTCTCTA TTAGCCTCTT TA - #TTGATCAA     3420     - CGCAAAAATC TCATCTTTTT GGCTGTTAGC TTGAGCTTTT GCTTCCATTT TA - #TTTTTGTT     3480     - GCCGCTTTTG CTCTCCAATT TTTTCTCTAC TTCTTTCTCT AAATGCTCTC GT - #TTCCTTAG     3540     - AGATTTTTCA AGATCTTTCT GAGCTTTTTT CACTTCATCA TAATTGCCTG TG - #TTTTTAGC     3600     - GTCAGCTACA GCTTTATTGA AGTTTAAAGT TTTTCCAACC AATTCTTTGT TG - #CTGCTCAA     3660     - AAAATCTTTG ATAAGCTTAT TAGCTTCTTG TGGGGACAAT CCTTTAGTGG TT - #AGTTTATC     3720     - CTCTAAATTC CGCCTTACGA AACTAGTGAT AGCGAGATTA TTTAAATCAG GC - #AAATTAAA     3780     - GATAGCTACC TTGTTAAAGC CTACTTCTAA ATGGGAAACG CCATTCGTAA CG - #CCTACACC     3840     - CTTATTGGGA TTCTTGGAGG CGTTGGTGTA TTTGAAATTA GAATAATCAA CA - #AACATCAC     3900     - GCCATCATGT TTTAGGCTAC CTTGAAGAGT AACATTTTTC TCCCTATCTA AA - #GCTTTATC     3960     - TGCTTTTTTC CCATAATCTT TGAGAGTGTA GCTCAAATCC CCATTACCAA AC - #TCAGTAAT     4020     - TAAAGCTGAA TGTTTTGTGT CTTTTTTAGA AACAAAAGCA ATACGATCAT TC - #CCTAGGGC     4080     - GTCTAAATAA GCCTTAGAGT CTTTTTGGAA ATCTTTAATC TCAGTTCGGA AT - #TTTTCCTT     4140     - CTCTTTCTCG CTCAAGTTGT CTAATTTAGC ATTATTTTGT GCAAGAAATT CC - #ATGAAATC     4200     - TATTTTGTTT TGGATCTCTT CTTGACTTAA TGCTCGTTGT GAGCCTGTGA GT - #TGGTCTTC     4260     - TTTGTAGAGA TAAAAACTAG GGTTGTTAAT CCCTTTCTCA CCACCTGCTA TG - #ACTAAGCC     4320     - ACTGCCGTTT TTCATATGCA CATTAATTAT TGTAGCCACA TTGTTGCCTT GT - #TGGTCTTT     4380     - ATAACCAACG GTGGCGTTCC AATCATGCCT AGCTCCAGGA CCACCATTGC CC - #CCATACAA     4440     - CAATGAAACT TTTTCAGGTT CTATGCCATT ATGACTCCCC ATTAACACAG AA - #GACAGAGC     4500     - GTTATTGTGA ATCAATAATT GATTGAACTT GTAATTGGGA TCAATGTCAG CG - #ACTCCCTC     4560     - AACATCTAAC ATTTCCATAT CGCCAAGAGT GAATTTAGAA AAATTACCCC TT - #TCATCAAG     4620     - TAAATCTCTA GCTTCAGGCG GTAAGCCTTG TATGTCGGTG GTGGTAGTGG CT - #ATATCTGG     4680     - TTGGACATGG GGAACTGGTT CTTGATTGAT TGCTTCTTTG ACATCAGAAG AT - #TGTTTTTT     4740     - GTCAAATATA AATGAGAGAA AAATATCCAA CCAATCCCCA CCAGTAGGCT CT - #CCATTTTT     4800     - TTCTGCTTCT TGCCTTTCTT TCAAGGACTC ATCAAACACG CCCATGAACT TT - #TGATCCGT     4860     - TCGGATTTGA TTCCCTATAA TGATTCCTGC AAAAGATTGT TTGGCAGATT TC - #AAAAACTC     4920     - CGCTTTCTCT TTATCATCAA GGATAGGGGG TTGTATGATA TTTTCCATAA AA - #TTTCGGAT     4980     - CGATCGGGTG TTGATTTTAG ACGGATCGTT TTGATGGGAC ACCCAACTTG TG - #AAAATTCG     5040     - GTAACGCTGA TCCCCAAATT TCTGAAAGCT CTTTGTGGAA GATTCTACAT CA - #ATGAGATT     5100     - GTCTTTGTTG ATTAAATCAT TGCTCTTATT GATAAAGTCT GAAAAATACT GA - #TTCTTTTT     5160     - GGTAGGATTT TTGATCGCTT TATTGGAGTA TTCTTCCCTT AATTGCGAGA TT - #CCTTCAAA     5220     - AGCTTGCCTG TTATCCCTAT CGTTCTTATC AACGATTGGT TTTTGATCAG GA - #TCGTATGA     5280     - AGCGACAGCG TTATCAACTT TAAGAAAAGC TACTTGAAGA TTATTGATAA AT - #TGCTGCGG     5340     - GTTAAAAGCC GCTTCGGTTT GTGGTTGTTG GTCAATGGTT TCGTTAGTCA TT - #GTTTCTCC     5400     - TTACTATACC TAGTTTCATA CCTATCGGTA TCTTGTTAGT GTCAAAGACT GC - #TAAAAATC     5460     - GCTACACGCT TGAGTAGTCA ATTCATGTCT TTGAACATTC TCATTATACA AC - #AATATCAA     5520     - GCATTTTTTT TTTTTTTCAT TAATCATTAT TAATATTGAA AAAAGCCCGA TC - #GTTGCAGG     5580     - CGTCTAAAGG CTAGGACGAT TCACATTTTT ACCCACAATT AACAAAAATT TG - #CTAAAATG     5640     - AAAGAACTTT AATAAGAACA AGAAATAAGA AATGAATCTT TTTGAAGAAA TC - #AAATGCAA     5700     - CGCTAAAGTA TGGTTGGAGA AATGGTTAGT CAAGGTGAGC ACATGAAGCG TT - #CTAAAAAT     5760     - AGTTTGGTTT GTGTGTCATG CTTACTATAG CATGTATAAT TAAGCGTTTG TG - #ATCCACAA     5820     - GATCCACATG GGAAACAAAA TGGAAAACAA ATCAATAGGA CAGATTTTCA AA - #GATAGCCT     5880     - CAAAAAAAGT TTCTTTAGTG GTCTATGGAG TTGCTTAAAA TGGAGCTTTA TT - #CTCACTCT     5940     - GATCAGCTTG GGTTTGTTTT TGCTTGTTTT TAGGTTTCAA CCTGAGACGA TT - #AAAAAATA     6000     - CATCAAAGAT CCTAAAGATC TACAATTCTA CAACGACTTG AGAAAGAAAA AT - #GGTTGGGA     6060     - CAAGTAGGTT TACTATTCCT GAATGATGTC AGTTGCCGAG CAAAAGCGTC GG - #TTTTGATT     6120     - TAAATTATAA AGTAGCGCTT GTATGAATAG TATAAAAATA CTTTTTTTTT TG - #ATATACTC     6180     - AAGCGATTGA TTTCAATTTG AAAGGAAACG CATGAAATTT TTTACAAGAA TC - #ACTGACAG     6240     - CTACAAGAAA GTTGTAGTAA CTTTAGGGCT AGTGGTAACA ACCAATCCTT TA - #ATGGCGGT     6300     - CGCCAGTCCT ACAGAAGGCG TCACTGCAAC TAAAGGTTTG GTTATTCAGA TC - #ATTTCTGT     6360     - TCTAGCGATC GTAGGTGGTT GCGCTTTAGG GGTCAAAGGC ATAGCGGATA TT - #TGGAAAAT     6420     - CTCTGATGAC ATTAAAAGAG GTCAGGCGAC TGTTTTTGCT TACGCGCAAC CC - #ATAGCTAT     6480     - GTTAGCGGTG GCAGGCGGTA TTATCTATTT GAGCACTAAG TTTGGCTTCA AT - #ATTGGCGA     6540     - GGGTGGAGGA GCTAGCTAAG TTGATCAACA ATAATAATAA TAGCAATAAG AA - #ACTAAGAG     6600     - GCTTTTTTTT GAAAGTTCTC TTAAGTCTCG TTGTTTTCAG TTCGTATGGG TT - #AGCAAATG     6660     - ATGACAAAGA AGCCAAAAAA GAAGCACAAG AAAAAGAAAA AAACACTCCC AA - #TGGGCTTG     6720     - TTTATACAAA TTTAGATTTT GATAGTTTCA AAGCGACTAT CAAAAATTTG AA - #AGACAAGA     6780     - AAGTAACTTT CAAAGAAGTC AATCCCGATA TTATCAAAGA TGAAGTTTTT GA - #CTTCGTGA     6840     - TTGTCAATAG AGTCCTTAAA AAAATAAAGG ATTTGAAGCA TTACGATCCA GT - #TATTGAAA     6900     - AAATCTTTGA TGAAAAGGGT AAGGAAATGG GATTGAATGT AGAATTACAG AT - #CAATCCTG     6960     - AAGTGAAAGA CTTTTTTACT TTCAAAAGCA TCAGCACGAC CAACAAACAA CG - #CTGCTTTC     7020     - TGTCATTGCG CGGAGAAACA AGAGAAATTC TATGCGATGA TAAGTTATAT AA - #TGTTTTAT     7080     - TGGCCGTATT CAATTCTTAT GATCCTAATG ATCTTTTGAA ACATATTAGC AC - #CGTAGAGT     7140     - CTCTCAAAAA AATCTTTTAT ACGATTACAT GTGAAGCGGT ATATCTATAA AG - #AGAGAGGT     7200     - GTTTGTGGCA AGCAAGCAGG CTGATGAACA AAAAAAGCTA ATTATAGAGC AA - #GAGGTTCA     7260     - AAAGCGGCAG TTTCAAAAAA TAGAAGAACT TAAAGCAGAC ATGCAAAAGG GT - #GTCAATCC     7320     - CTTTTTTAAA GTCTTGTTTG ATGGGGGGAA TAGGTTGTTT GGTTTCCCTG AA - #ACTTTTAT     7380     - TTATTCCTCT ATATTTATAT TGTTTGTAAC CATTGTATTA TCTGTTATTC TT - #TTTCAAGC     7440     - CTATGAACCT GTTTTGATTG TAGCGATTGT TATTGTGCTT GTAGCTCTTG GA - #TTCAAGAA     7500     - AGATTATAGG CTTTATCAAA GAATGGAGCG AGCGATGAAA TTTAAAAAAC CT - #TTTTTGTT     7560     - TAAGGGCGTG AAAAACAAAG CGTTCATGAG CATTTTTTCC ATGAAGCCTA GT - #AAAGAAAT     7620     - GGCTAATGAC ATCCACTTAA ATCCAAACAG AGAAGACAGA CTTGTGAGCG CT - #GCAAACTC     7680     - CTATCTAGCA AATAACTATG AATGTTTTTT AGATGATGGG GTGATCCTTA CT - #AACAACTA     7740     - TTCTCTTTTA GGCACAATCA AATTGGGGGG CATTGATTTT TTAACCACTT CC - #AAAAAAGA     7800     - TCTCATAGAG TTACACGCTT CTATTTATAG CGTTTTTAGG AATTTTGTTA CC - #CCTGAATT     7860     - CAAATTCTAT TTTCACACTA TTAAAAAGAA AATCGTTATT GATGAAACCA AT - #AGGGACTA     7920     - TGGTCTTATT TTTTCTAATG ATTTCATGCG AGCCTATAAT GAGAAGCAAA AG - #AGAGAAAG     7980     - TTTTTATGAT ATTAGTTTTT TTCTGACCAT AGAGCAAGAT TTATTAGACA CT - #CTCAATGA     8040     - ACCCGTTATG AATAAAAAGC ATTTTGCAGA CAATAATTTT GAAGAGTTTC AA - #AGGATTAT     8100     - TAGAGCCAAG CTTGAAAACT TCAAGGATAG GATAGAGCTC ATAGAAGAGC TG - #TTGAGCAA     8160     - ATACCACCCC ACTAGATTAA AAGAATACAC TAAAGATGGC GTTATTTACT CC - #AAACAATG     8220     - CGAATTTTAC AATTTTCTTG TGGGAATGAA TGAAGCCCCT TTTATTTGCA AC - #CGAAAAGA     8280     - CTTGTATCTC AAGGAAAAAA TGCATGGTGG GGTGAAAGAA GTTTATTTTG CC - #AATAAGCA     8340     - TGGAAAAATC TTAAATGACG ATTTGAGTGA AAAATATTTT AGCGCTATTG AG - #ATTAGTGA     8400     - ATACGCCCCT AAATCACAAA GCGATTTGTT TGATAAAATC AACGCCCTAG AC - #AGCGAATT     8460     - TATTTTCATG CATGCTTATT CGCCTAAAAA CTCACAGGTT TTAAAGGACA AA - #CTGGCTTT     8520     - CACCTCTAGA AGAATTATTA TTAGTGGAGG CTCTAAAGAA CAGGGCATGA CT - #TTAGGTTG     8580     - CTTGAGCGAA TTAGTGGGTA ATGGTGATAT TACGCTAGGC AGTTATGGTA AT - #TCTTTAGT     8640     - GCTGTTTGCT GATAGCTTTG AAAAAATGAA ACAAAGCGTT AAGGAATGCG TC - #TCTAGTCT     8700     - TAACGCTAAA GGTTTTTTAG CCAACGCAGC GACTTTCTCT ATGGAAAATT AC - #TTTTTTGC     8760     - CAAACATTGC TCTTTTATCA CGCTTCCTTT TATTTTTGAT GTAACTTCTA AT - #AATTTTGC     8820     - TGATTTCATC GCTATGAGGG CTATGAGTTT TGATGGCAAT CAAGAGAATA AC - #GCTTGGGG     8880     - CAATAGTGTG ATGACGCTAA AAAGCGAGAT CAATTCGCCT TTTTATCTGA AC - #TTCCACAT     8940     - GCCCACTGAT TTTGGTTCAG CTTCAGCAGG ACACACTTTG ATACTTGGCT CA - #ACCGGTTC     9000     - AGGTAAGACA GTGTTTATGT CAATGACCTT GAACGCTATG GGACAATTTG CC - #TATAATTT     9060     - TCCTGCTAAT GTCAGCAAAG ACAAGCAAAA GCTCACTATG GTTTATATGG AT - #AAAGATTA     9120     - TGGCGCTTAT GGGAATATTG TCGCAATGGG TGGGGAGTAT GTCAAGATTG AG - #CTAGGGAC     9180     - AGATACAGGA TTAAATCCTT TTGCTTGGGC GGCTTGTGTG CAAAAAACAA AT - #GCAACAAT     9240     - GGAGCAAAAA CAAACAGCTA TTTCTGTTGT CAAAGAGCTT GTGAAAAACT TA - #GCAACTAA     9300     - AAGCGATGAA AAAGATGAAA ATGGCAACAG CATCTCTTTT AGCCTAGCAG AT - #TCTAATAC     9360     - GCTTGCAGCG CAGTAACCAA CCTTATCACA GGAAATATGA ATCTAGATTA TC - #CCATCACT     9420     - CAACTTATTA ATGCTTTCGG GAAAGACCAC AATGATCCTA ATGGGCTTGT CG - #CGCGATTA     9480     - GCGCCTTTTT GCAAATCAAC CAATGGTGAA TTTCAATGGC TTTTTGATAA TA - #AAGCAACA     9540     - GATCGCTTAG ATTTTTCAAA AACGATTATT GGCGTTGATG GGTCAAGTTT CT - #TAGACAAT     9600     - AATGATGTTT CGCCCTTTAT TTGTTTTTAC CTTTTCGCTC GTATCCAAGA GG - #CAATGGAT     9660     - GGGCGTAGAT TTGTCTTAGA TATTGATGAA GCCTGGAAAT ATTTAGCGAT CC - #AAAGGTCG     9720     - CTTATTTTGT AAGAGACATG CTAAAAACTG CAAGGAAAAG AAACGCTATT GT - #CAGACTTG     9780     - CGACTCAAAG CATCACTGAT CTTTTGGCTT GCCCTATTGC TGATACGATT AG - #AGAACAAT     9840     - GCCCTACAAA GATTTTTTTG AGAAACGATG GGGGCAATCT TTCTGATTAC CA - #AAGATTGG     9900     - CTAATGTTAC AGAAAAAGAA TTTGAAATCA TCACTAAGGG ACTAGATAGG AA - #AATCCTCT     9960     - ACAAACAGGA TGGAAGCCCT AGCGTTATCG CTAGTTTTAA TTTGAGAGGC AT - #TCCTAAAG     10020     - AATATTTGAA AATTTTATCC ACAGATACTG TATTTGTCAA AGAAATTGAT AA - #GATTATCC     10080     - AAAACCATAG TATCATAGAT AAATATCAGG CCTTGAGACA AATGTATCAA CA - #AATAGAGG     10140     - AGTATTAAAA TGAAACAAAA TTTGCGTGAA CAAAAATTAT GGAAATTTTA GA - #AAATGATG     10200     - TCTTGACGAT TTTGGATAGT TTTTCTAATT ATCTTTTTGA ACTGAAAGAA GA - #ATTGGACT     10260     # 10299            GGAA GGCGAAATCA CTGAACAAA     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3169 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:6: (xi) SEQUENCE DESCRIPTION: SEQ     -      Ala Leu Ser Ile Leu Lys Cys Asn - # Lys Leu Arg Thr Met Lys Asn     Phe     #   15     -      Thr Met Lys Asn Lys Ile Pro Asn - # Ser His Ile Lys Lys Val Ser     Lys     #                 30     -      Ala Phe Leu Ile Ala Val Trp Glu - # Gly Leu Ala Phe Lys Ala Phe     Lys     #             45     -      Arg Asn Tyr Leu Met Lys Ser Ser - # Ile Met Ala Ile Ala Leu Glu     Cys     #         60     -      Leu Met Ala Leu Lys Thr Pro Leu - # Arg Ser Ser Asn Leu Ala Arg     Leu     #     80     -      Trp Ile Phe Ser Asn Arg Thr Arg - # Leu Asn Tyr Leu Trp His Ala     Thr     #   95     -      Gln Arg Ala Leu Leu Lys Arg Cys - # Lys Ser Ile Pro Lys Ser Leu     Leu     #                110     -      Trp Ala Leu Ser Gln Ala Phe Arg - # Ser Ser Asp Lys Lys Ile Lys     Thr     #            125     -      Pro Leu Phe Cys Gly Gln Lys Arg - # Arg Ser Asn Pro Thr Leu Met     Thr     #        140     -      Met Pro Asn Asn Lys Ala Ile Met - # Phe Arg Ile Pro Leu Leu Phe     Leu     #    160     -      Cys Leu Leu Lys Lys Val Phe Arg - # Ala Asn Cys Lys Leu Ala Cys     Val     #   175     -      Ile Ile Ser Leu Pro Arg Phe Ser - # Arg Ser Asp Tyr Phe Arg Leu     His     #                190     -      Ala Phe Ser Leu Asn Arg Ser Lys - # Asn Gly Leu Phe Tyr Gly Ala     Phe     #            205     -      Cys Pro Phe Asn Thr Pro Pro Thr - # His Pro Phe Gly Arg Cys Tyr     Cys     #        220     -      Arg Ile Phe Ala Ala Lys Ile Arg - # Pro Lys Lys Cys Thr Arg Ile     Pro     #    240     -      Ser Gly Ile Ser Cys Glu Arg Arg - # Cys Asp Leu Ala Arg Lys Thr     Ser     #   255     -      Arg Met Ala Gln Ile Val Thr Leu - # Ile Lys Ile Ser Gln Asn Asn     Ser     #                270     -      Ile Pro Lys Lys Gly Leu Leu Thr - # Leu Gly Tyr Cys Val Asp Tyr     Lys     #            285     -      Ala Leu Arg Asp Leu Lys Arg Phe - # Glu Ala Lys Asn Val Leu Ser     Leu     #        300     -      Glu Lys Ile Thr Tyr Leu Gln Thr - # Cys Lys Ile Ala Leu Lys Ile     Phe     #    320     -      Phe Ser Ser Leu Arg Lys Phe Ser - # Lys Lys Asp Phe Leu Pro Thr     Lys     #   335     -      Met Arg Phe Ser Phe Ala Leu Glu - # Ile Ala Thr Phe Lys Arg Leu     Glu     #                350     -      Ser Ser Lys Lys Glu Arg Leu Pro - # Gln Val Phe Thr Val Glu Ile     Ile     #            365     -      Ile Ser Ala Ser Ser Pro Lys Ala - # Ser Thr Val Ser Ile Leu Ser     Ser     #        380     -      Ser Lys Phe Leu Lys Pro Tyr Phe - # Ser Thr Ser Asp Arg Lys Arg     Leu     #    400     -      Phe Trp Ala Leu Gly Val Ile Ile - # Pro Met Val Phe Leu Ile Phe     Leu     #   415     -      Trp Leu Arg Ala Arg Ser Ile Cys - # Ser Lys Ser Ser Met Ile Phe     Thr     #                430     -      Ser Leu Leu Leu Leu Val Phe Ser - # Ser Leu Cys Ser Leu Tyr Asn     Pro     #            445     -      Leu Ile Phe Ala Leu His Glu Pro - # Thr Ser Thr Ile Lys Leu Asn     Phe     #        460     -      Lys Gln Arg Val Lys Ser Phe Trp - # Gly Arg Lys Tyr Pro Cys Arg     Leu     #    480     -      Arg Thr Lys Pro Phe Val Ser Val - # Cys Arg Asp Ile Leu Arg Ser     Ala     #   495     -      Gln Pro Leu Lys Thr Ile Gly Leu - # Pro Arg Leu Met Leu Pro Val     Gly     #                510     -      Arg Val Trp Phe Ser Leu Cys Val - # Pro Ile Ile Gly Ile Leu Gln     Ser     #            525     -      Leu Ser Ser Asn Ala Phe Pro Thr - # Gly Leu Thr His His Ile Ser     Lys     #        540     -      Gly Val Thr Leu His Thr Ser Tyr - # Ala Arg Tyr Glu Cys Met Gly     Ile     #    560     -      Tyr His Glu Leu Val Ala Phe Ile - # Pro Asn Thr Lys Asp Asp Trp     Asp     #   575     -      Phe Cys Gly Val Leu Lys Asn Pro - # Ser Leu Leu Ala Arg Gly Gly     Phe     #                590     -      Cys Val Phe Gly Ile Pro Ser Phe - # Lys Ile Phe Gly Asn His Leu     Leu     #            605     -      Tyr Arg Ser Phe His Ala Pro His - # Phe Leu Ser Pro Ser Ser Asn     Lys     #        620     -      Met Leu Lys Met His Trp Leu Lys - # Ile Ala Ser Gln Asn Cys Leu     Leu     #    640     -      Gln His Cys Ser Thr Trp Trp Lys - # Thr Thr Asn Gln Asn Asn Leu     Ser     #   655     -      Tyr Ser Ser Gly Ser Gln Phe Tyr - # His Thr Leu Thr Lys Glu Arg     Phe     #                670     -      Leu His Tyr Ala Gln Leu Ser Tyr - # His Ser Arg Ala Ala Thr Gln     Gly     #            685     -      Phe Phe Ala Leu Ala Cys Arg Ser - # Leu Phe His Leu Leu His Phe     Tyr     #        700     -      Cys Tyr Cys Val Cys Gly Cys Arg - # Asn Cys Leu Val Ser Leu Thr     Asn     #    720     -      Gln Val Leu Phe Leu His Phe Gln - # Pro Ile Asp Ser Trp Asp Cys     Val     #   735     -      Leu Asn Leu Ala Cys Leu Ser Phe - # Ala Leu Asp Cys Gln Lys Asn     Leu     #                750     -      Leu Leu Leu Ile Pro Leu Asp Asp - # Cys Gln Phe Ser Glu Pro Ile     Leu     #            765     -      Asp Ser Leu Lys Ala Leu Pro Tyr - # Asp His Gln Leu Tyr His Ala     Phe     #        780     -      Ser Lys Gly Ser Pro Leu Leu Ala - # Leu Gln His Pro Asn His Leu     Leu     #    800     -      Phe Asp Val Gly Gln Phe Leu His - # Leu Pro Phe Gln Leu Glu Arg     Lys     #   815     -      Trp Val Leu Gln Gly Leu Leu Ala - # Leu Leu Ser Phe Tyr Leu His     Lys     #                830     -      Trp Val Arg Phe Val His Tyr Cys - # Tyr Cys Tyr Asn Phe Gln Val     Leu     #            845     -      His Ser Thr Leu Ser Cys Pro Lys - # Ser Phe Lys Glu Leu Trp Leu     Leu     #        860     -      Leu Glu Ile Thr His Tyr Arg Leu - # Gly Phe His Ser His His Ser     Arg     #    880     -      Ile Asp Ile Phe Gln Ile Phe Phe - # His Asp Phe His Phe Phe Glu     Leu     #   895     -      Val Gly Pro Ile Ala Pro Leu Arg - # Asn Phe Asp Arg Leu Thr Leu     Ala     #                910     -      Leu Pro Tyr Asn His Pro Leu Leu - # Pro Leu Ile Pro Leu Asp Asp     Tyr     #            925     -      Gln Leu Tyr Pro Leu Pro Phe Asp - # Ser His Leu Arg Asn Phe Gln     Gly     #        940     -      Arg Phe Leu Leu Ala Leu Pro Cys - # Asn Pro Tyr Phe Cys His Phe     Ile     #    960     -      His Ser Lys Leu His Gly Phe Gln - # Leu Leu Lys Ser Ile Leu Asp     Tyr     #   975     -      Leu Asn Leu Ser Pro Asn Leu Gly - # Leu Leu Val Phe Leu Leu Pro     Cys     #                990     -      Asn Pro Tyr Phe Cys His Phe Ile - # His Gln Lys Ile Tyr Gln Ser     Leu     #           10050     -      Ser Asn Ser Cys His Phe Gln Val - # Tyr Gln Thr Ile Pro Phe Cys     Leu     #       10205     -      Asp Ser Glu Arg Lys Arg Leu Leu - # Leu Arg Leu Tyr Pro Leu Tyr     Ser     #  10405     -      Thr Gln Lys Ser His Leu Phe Gly - # Cys Leu Glu Leu Leu Leu Pro     Phe     # 10550     -      Tyr Phe Cys Cys Arg Phe Cys Ser - # Pro Ile Phe Ser Leu Leu Leu     Ser     #               10700 - #                1065     -      Leu Asn Ala Leu Val Ser Leu Glu - # Ile Phe Gln Asp Leu Ser Glu     Leu     #           10850     -      Phe Ser Leu His His Asn Cys Leu - # Cys Phe Arg Gln Leu Gln Leu     Tyr     #       11005     -      Ser Leu Lys Phe Phe Gln Pro Ile - # Leu Cys Cys Cys Ser Lys Asn     Leu     #  11205     -      Ala Tyr Leu Leu Val Gly Thr Ile - # Leu Trp Leu Val Tyr Pro Leu     Asn     # 11350     -      Ser Ala Leu Arg Asn Arg Asp Tyr - # Leu Asn Gln Ala Asn Arg Leu     Pro     #               11500 - #                1145     -      Cys Ser Leu Leu Leu Asn Gly Lys - # Arg His Ser Arg Leu His Pro     Tyr     #           11650     -      Trp Asp Ser Trp Arg Arg Trp Cys - # Ile Asn Asn Asn Gln Gln Thr     Ser     #       11805     -      Arg His His Val Leu Gly Tyr Leu - # Glu Glu His Phe Ser Pro Tyr     Leu     #  12005     -      Lys Leu Tyr Leu Leu Phe Ser His - # Asn Leu Glu Cys Ser Ser Asn     Pro     # 12150     -      His Tyr Gln Thr Gln Leu Lys Leu - # Asn Val Leu Cys Leu Phe Lys     Gln     #               12300 - #                1225     -      Lys Gln Tyr Asp His Ser Leu Gly - # Arg Leu Asn Lys Pro Ser Leu     Phe     #           12450     -      Gly Asn Leu Ser Gln Phe Gly Ile - # Phe Pro Ser Leu Ser Arg Ser     Ser     #       12605     -      Cys Leu Ile His Tyr Phe Val Gln - # Glu Ile Pro Asn Leu Phe Cys     Phe     #  12805     -      Gly Ser Leu Leu Asp Leu Met Leu - # Val Val Ser Leu Val Gly Leu     Leu     # 12950     -      Cys Arg Asp Lys Asn Gly Cys Ser - # Leu Ser His His Leu Leu Leu     Ser     #               13100 - #                1305     -      His Cys Arg Phe Ser Tyr Ala His - # Leu Leu Pro His Cys Cys Leu     Val     #           13250     -      Gly Leu Tyr Asn Gln Arg Trp Arg - # Ser Asn His Ala Leu Gln Asp     His     #       13405     -      His Cys Pro His Thr Thr Met Lys - # Leu Phe Gln Val Leu Cys His     Tyr     #  13605     -      Asp Ser Pro Leu Thr Gln Lys Thr - # Glu Arg Tyr Cys Glu Ser Ile     Ile     # 13750     -      Asp Thr Cys Asn Trp Asp Gln Cys - # Gln Arg Leu Pro Gln His Leu     Thr     #               13900 - #                1385     -      Phe Pro Tyr Arg Gln Glu Ile Lys - # Asn Tyr Pro Phe His Gln Val     Asn     #           14050     -      Leu Leu Gln Ala Val Ser Leu Val - # Cys Arg Trp Trp Trp Leu Tyr     Leu     #       14205     -      Val Gly His Gly Glu Leu Val Leu - # Asp Leu Leu Leu His Gln Lys     Ile     #  14405     -      Val Phe Cys Gln Ile Met Arg Glu - # Lys Tyr Pro Thr Asn Pro His     Gln     # 14550     -      Ala Leu His Phe Phe Leu Leu Leu - # Ala Phe Leu Ser Arg Thr His     Gln     #               14700 - #                1465     -      Thr Arg Pro Thr Phe Asp Pro Phe - # Gly Phe Asp Ser Leu Phe Leu     Gln     #           14850     -      Lys Ile Val Trp Gln Ile Ser Lys - # Thr Pro Leu Ser Leu Tyr His     Gln     #       15005     -      Gly Gly Val Val Tyr Phe Pro Asn - # Phe Gly Ser Ile Gly Cys Phe     Thr     #  15205     -      Asp Arg Phe Asp Gly Thr Pro Asn - # Leu Lys Phe Gly Asn Ala Asp     Pro     # 15350     -      Gln Ile Ser Glu Ser Ser Leu Trp - # Lys Ile Leu His Gln Asp Cys     Leu     #               15500 - #                1545     -      Cys Leu Asn His Cys Ser Tyr Ser - # Leu Lys Asn Thr Asp Ser Phe     Trp     #           15650     -      Asp Phe Ser Leu Tyr Trp Ser Ile - # Leu Pro Leu Ile Ala Arg Phe     Leu     #       15805     -      Gln Lys Leu Ala Cys Tyr Pro Tyr - # Arg Ser Tyr Gln Arg Leu Val     Phe     #  16005     -      Asp Gln Asp Arg Met Lys Arg Gln - # Arg Tyr Gln Leu Glu Lys Leu     Leu     # 16150     -      Glu Asp Tyr Ile Ala Ala Gly Lys - # Pro Leu Arg Phe Val Val Val     Gly     #               16300 - #                1625     -      Gln Trp Phe Arg Ser Leu Phe Leu - # Leu Thr Ile Pro Ser Phe Ile     Pro     #           16450     -      Ile Gly Ile Leu Leu Val Ser Lys - # Thr Ala Lys Asn Arg Tyr Thr     Leu     #       16605     -      Glu Ser Ile His Val Phe Glu His - # Ser His Tyr Thr Thr Ile Ser     Ser     #  16805     -      Ile Phe Phe Phe Phe His Ser Leu - # Leu Ile Leu Lys Lys Ala Arg     Ser     # 16950     -      Leu Gln Ala Ser Lys Gly Asp Asp - # Ser His Phe Tyr Pro Gln Leu     Thr     #               17100 - #                1705     -      Lys Ile Cys Asn Glu Arg Thr Leu - # Ile Arg Thr Arg Asn Lys Lys     Ile     #           17250     -      Phe Leu Lys Lys Ser Asn Ala Thr - # Leu Lys Tyr Gly Trp Arg Asn     Gly     #       17405     -      Ser Arg Ala His Glu Ala Phe Lys - # Phe Gly Leu Cys Val Met Leu     Thr     #  17605     -      Ile Ala Cys Ile Ile Lys Arg Leu - # Ser Thr Arg Ser Thr Trp Glu     Thr     # 17750     -      Lys Trp Lys Thr Asn Gln Asp Arg - # Phe Ser Lys Ile Ala Ser Lys     Lys     #               17900 - #                1785     -      Val Ser Leu Val Val Tyr Gly Val - # Ala Asn Gly Ala Leu Phe Ser     Leu     #           18050     -      Ser Ala Trp Val Cys Phe Cys Leu - # Phe Leu Gly Phe Asn Leu Arg     Arg     #       18205     -      Leu Lys Asn Thr Ser Lys Ile Leu - # Lys Ile Tyr Asn Ser Thr Thr     Thr     #  18405     -      Glu Arg Lys Met Val Gly Thr Ser - # Arg Phe Thr Ile Pro Glu Cys     Gln     # 18550     -      Leu Pro Ser Lys Ser Val Gly Phe - # Asp Leu Asn Tyr Lys Val Ala     Leu     #               18700 - #                1865     -      Val Ile Val Lys Tyr Phe Phe Phe - # Tyr Thr Gln Ala Ile Asp Phe     Asn     #           18850     -      Leu Lys Gly Asn Ala Asn Phe Leu - # Gln Glu Ser Leu Thr Ala Thr     Arg     #       19005     -      Lys Leu Leu Gly Trp Gln Pro Ile - # Leu Trp Arg Ser Pro Val Leu     Gln     #  19205     -      Lys Ala Ser Leu Gln Leu Lys Val - # Trp Leu Phe Arg Ser Phe Leu     Phe     # 19350     -      Arg Ser Val Val Ala Leu Gly Ser - # Lys Ala Arg Ile Phe Gly Lys     Ser     #               19500 - #                1945     -      Leu Met Thr Leu Lys Glu Val Arg - # Arg Leu Phe Leu Leu Thr Arg     Asn     #           19650     -      Pro Leu Cys Arg Trp Gln Ala Val - # Leu Ser Ile Ala Leu Ser Leu     Ala     #       19805     -      Ser Ile Leu Ala Arg Val Glu Glu - # Leu Ala Lys Leu Ile Asn Asn     Asn     #  20005     -      Asn Asn Ser Asn Lys Lys Leu Arg - # Gly Phe Phe Leu Lys Val Leu     Leu     # 20150     -      Ser Leu Val Val Phe Ser Ser Tyr - # Gly Leu Ala Asn Asp Asp Lys     Glu     #               20300 - #                2025     -      Ala Lys Lys Glu Ala Gln Glu Lys - # Glu Lys Asn Thr Pro Asn Gly     Leu     #           20450     -      Val Tyr Thr Asn Leu Asp Phe Asp - # Ser Phe Lys Ala Thr Ile Lys     Asn     #       20605     -      Leu Lys Asp Lys Lys Val Thr Phe - # Lys Glu Val Asn Pro Asp Ile     Ile     #  20805     -      Lys Asp Glu Val Phe Asp Phe Val - # Ile Val Asn Arg Val Leu Lys     Lys     # 20950     -      Ile Lys Asp Leu Lys His Tyr Asp - # Pro Val Ile Glu Lys Ile Phe     Asp     #               21100 - #                2105     -      Glu Lys Gly Lys Glu Met Gly Leu - # Asn Val Glu Leu Gln Ile Asn     Pro     #           21250     -      Glu Val Lys Asp Phe Phe Thr Phe - # Lys Ser Ile Ser Thr Thr Asn     Lys     #       21405     -      Gln Arg Cys Phe Leu Ser Leu Arg - # Gly Glu Thr Arg Glu Ile Leu     Cys     #  21605     -      Asp Asp Lys Leu Tyr Asn Val Leu - # Leu Ala Val Phe Asn Ser Tyr     Asp     # 21750     -      Pro Asn Asp Leu Leu Lys His Ile - # Ser Thr Val Glu Ser Leu Lys     Lys     #               21900 - #                2185     -      Ile Phe Tyr Thr Ile Thr Cys Glu - # Ala Val Tyr Leu Arg Glu Arg     Cys     #           22050     -      Leu Trp Gln Ala Ser Arg Leu Met - # Asn Lys Lys Ser Leu Ser Lys     Arg     #       22205     -      Phe Lys Ser Gly Ser Phe Lys Lys - # Lys Asn Leu Lys Gln Thr Cys     Lys     #  22405     -      Arg Val Ser Ile Pro Phe Leu Lys - # Ser Cys Leu Met Gly Gly Ile     Gly     # 22550     -      Cys Leu Val Ser Leu Lys Leu Leu - # Phe Ile Pro Leu Tyr Leu Tyr     Cys     #               22700 - #                2265     -      Leu Pro Leu Tyr Tyr Leu Leu Phe - # Phe Phe Lys Pro Met Asn Leu     Phe     #           22850     -      Leu Arg Leu Leu Leu Cys Leu Leu - # Leu Asp Ser Arg Lys Ile Ile     Gly     #       23005     -      Phe Ile Lys Glu Trp Ser Glu Arg - # Asn Leu Lys Asn Leu Phe Cys     Leu     #  23205     -      Arg Ala Lys Thr Lys Arg Ser Ala - # Phe Phe Pro Ser Leu Val Lys     Lys     # 23350     -      Trp Leu Met Thr Ser Thr Ile Gln - # Thr Glu Lys Thr Asp Leu Ala     Leu     #               23500 - #                2345     -      Gln Thr Pro Ile Gln Ile Thr Met - # Asn Val Phe Met Met Gly Ser     Leu     #           23650     -      Leu Thr Thr Ile Leu Phe Ala Gln - # Ser Asn Trp Gly Ala Leu Ile     Phe     #       23805     -      Pro Leu Pro Lys Lys Ile Ser Ser - # Tyr Thr Leu Leu Phe Ile Ala     Phe     #  24005     -      Leu Gly Ile Leu Leu Pro Leu Asn - # Ser Asn Ser Ile Phe Thr Leu     Leu     # 24150     -      Lys Arg Lys Ser Leu Leu Met Lys - # Pro Ile Gly Thr Met Val Leu     Phe     #               24300 - #                2425     -      Phe Leu Met Ile Ser Cys Glu Pro - # Ile Met Arg Ser Lys Arg Glu     Lys     #           24450     -      Val Phe Met Ile Leu Val Phe Phe - # Pro Ser Lys Ile Tyr Thr Leu     Ser     #       24605     -      Met Asn Pro Leu Ile Lys Ser Ile - # Leu Gln Thr Ile Ile Leu Lys     Ser     #  24805     -      Phe Lys Gly Leu Leu Glu Pro Ser - # Leu Lys Thr Ser Arg Ile Gly     Ser     # 24950     -      Ser Lys Ser Cys Ala Asn Thr Thr - # Pro Leu Asp Lys Asn Thr Leu     Lys     #               25100 - #                2505     -      Met Ala Leu Phe Thr Pro Asn Asn - # Ala Asn Phe Thr Ile Phe Leu     Trp     #           25250     -      Glu Met Lys Pro Leu Leu Phe Ala - # Thr Glu Lys Thr Cys Ile Ser     Arg     #       25405     -      Lys Lys Cys Met Val Gly Lys Lys - # Phe Ile Leu Pro Ile Ser Met     Glu     #  25605     -      Lys Ser Met Thr Ile Val Lys Asn - # Ile Leu Ala Leu Leu Arg Leu     Val     # 25750     -      Asn Thr Pro Leu Asn His Lys Ala - # Ile Cys Leu Ile Lys Ser Thr     Pro     #               25900 - #                2585     -      Thr Ala Asn Leu Phe Ser Cys Met - # Leu Ile Arg Leu Lys Thr His     Arg     #           26050     -      Phe Arg Thr Asn Trp Leu Ser Pro - # Leu Glu Glu Leu Leu Leu Val     Glu     #       26205     -      Ala Leu Lys Asn Arg Ala Leu Val - # Ala Ala Asn Trp Val Met Val     Ile     #  26405     -      Leu Arg Ala Val Met Val Ile Leu - # Cys Cys Leu Leu Ile Ala Leu     Lys     # 26550     -      Lys Asn Lys Ala Leu Arg Asn Ala - # Ser Leu Val Leu Thr Leu Lys     Val     #               26700 - #                2665     -      Phe Pro Thr Gln Arg Leu Ser Leu - # Trp Lys Ile Thr Phe Leu Pro     Asn     #           26850     -      Ile Ala Leu Leu Ser Arg Phe Leu - # Leu Phe Leu Met Leu Leu Ile     Ile     #       27005     -      Leu Leu Ile Ser Ser Leu Gly Leu - # Val Leu Met Ala Ile Lys Arg     Ile     #  27205     -      Thr Leu Gly Ala Ile Val Arg Lys - # Ala Arg Ser Ile Arg Leu Phe     Ile     # 27350     -      Thr Ser Thr Cys Pro Leu Ile Leu - # Val Gln Leu Gln Gln Asp Thr     Leu     #               27500 - #                2745     -      Tyr Leu Ala Gln Pro Val Gln Val - # Arg Gln Cys Leu Cys Gln Pro     Thr     #           27650     -      Leu Trp Asp Asn Leu Pro Ile Ile - # Phe Leu Leu Met Ser Ala Lys     Thr     #       27805     -      Ser Lys Ser Ser Leu Trp Phe Ile - # Trp Ile Lys Ile Met Ala Leu     Met     #  28005     -      Gly Ile Leu Ser Gln Trp Val Gly - # Ser Met Ser Arg Leu Ser Gly     Gln     # 28150     -      Ile Gln Asp Ile Leu Leu Leu Gly - # Arg Leu Val Cys Lys Lys Gln     Met     #               28300 - #                2825     -      Gln Gln Trp Ser Lys Asn Lys Gln - # Leu Phe Leu Leu Ser Lys Ser     Leu     #           28450     -      Lys Thr Gln Leu Lys Ala Met Lys - # Lys Met Lys Met Ala Thr Ala     Ser     #       28605     -      Leu Leu Ala Gln Ile Leu Ile Arg - # Leu Gln Arg Ser Asn Gln Pro     Tyr     #  28805     -      His Arg Lys Tyr Glu Ser Arg Leu - # Ser His His Ser Thr Tyr Cys     Phe     # 28950     -      Arg Glu Arg Pro Gln Ser Trp Ala - # Cys Arg Ala Ile Ser Ala Phe     Leu     #               29100 - #                2905     -      Gln Ile Asn Gln Trp Ile Ser Met - # Ala Phe Ser Asn Arg Ser Leu     Arg     #           29250     -      Phe Phe Lys Asn Asp Tyr Trp Arg - # Trp Val Lys Phe Leu Arg Gln     Cys     #       29405     -      Phe Ala Leu Tyr Leu Phe Leu Pro - # Phe Arg Ser Tyr Pro Arg Gly     Asn     #  29605     -      Gly Trp Ala Ile Cys Leu Arg Tyr - # Ser Leu Glu Ile Phe Ser Asp     Pro     # 29750     -      Lys Val Ala Tyr Phe Val Arg Asp - # Met Leu Lys Thr Ala Arg Lys     Arg     #               29900 - #                2985     -      Asn Ala Ile Val Arg Leu Ala Thr - # Gln Ser Ile Thr Asp Leu Leu     Ala     #           30050     -      Cys Pro Ile Ala Asp Thr Ile Arg - # Glu Gln Cys Pro Thr Lys Ile     Phe     #       30205     -      Leu Arg Asn Asp Gly Gly Asn Leu - # Ser Asp Tyr Gln Arg Leu Ala     Asn     #  30405     -      Val Thr Glu Lys Glu Phe Glu Ile - # Ile Thr Lys Gly Leu Asp Arg     Lys     # 30550     -      Ile Leu Tyr Lys Gln Asp Gly Ser - # Pro Ser Val Ile Ala Ser Phe     Asn     #               30700 - #                3065     -      Leu Arg Gly Ile Pro Lys Glu Tyr - # Leu Lys Ile Leu Ser Thr Asp     Thr     #           30850     -      Val Phe Val Lys Glu Ile Asp Lys - # Ile Ile Gln Asn His Ser Ile     Ile     #       31005     -      Asp Lys Tyr Gln Ala Leu Arg Gln - # Met Tyr Gln Gln Ile Glu Glu     Tyr     #  31205     -      Asn Glu Thr Lys Phe Ala Thr Lys - # Ile Met Glu Ile Leu Glu Asn     Asp     # 31350     -      Val Leu Thr Ile Leu Asp Ser Phe - # Ser Asn Tyr Leu Phe Glu Leu     Lys     #               31500 - #                3145     -      Glu Glu Leu Asp Phe Ile Glu Glu - # Glu Met Glu Gly Glu Ile Thr     Glu     #           31650     -      Gln     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3287 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:7: (xi) SEQUENCE DESCRIPTION: SEQ     -      Ser Phe Val Tyr Ser Lys Met Gln - # Gln Thr Lys Asp Asn Glu Glu     Phe     #   15     -      Leu Asn Leu Met Asn Glu Lys Asn - # Pro Leu Ile Pro Leu Asn Lys     Ser     #                 30     -      Val Met Lys Ile Gly Val Phe Asp - # Ser Gly Val Gly Gly Phe Ser     Val     #             45     -      Leu Lys Ser Leu Leu Lys Ala Gln - # Leu Phe Asp Glu Ile Ile Tyr     Tyr     #         60     -      Gly Asp Ser Ala Arg Val Pro Tyr - # Gly Thr Lys Asp Pro Thr Thr     Ile     #     80     -      Lys Gln Phe Gly Leu Glu Ala Leu - # Asp Phe Phe Lys Pro His Gln     Ile     #   95     -      Glu Leu Leu Ile Val Ala Cys Asn - # Thr Ala Ser Ala Leu Ala Leu     Glu     #                110     -      Glu Met Gln Lys His Ser Lys Ile - # Pro Ile Val Gly Val Ile Glu     Pro     #            125     -      Ser Ile Leu Ala Ile Lys Arg Gln - # Val Lys Asp Lys Asn Ala Pro     Ile     #        140     -      Leu Val Leu Gly Thr Lys Ala Thr - # Ile Gln Ser Asn Ala Tyr Asp     Asn     #    160     -      Ala Leu Lys Gln Gln Gly Tyr Leu - # Asn Val Ser His Leu Ala Thr     Ser     #   175     -      Leu Phe Val Pro Leu Ile Glu Glu - # Ser Ile Leu Glu Gly Glu Leu     Leu     #                190     -      Glu Thr Cys Met Arg Tyr Tyr Phe - # Thr Pro Leu Lys Ile Phe Pro     Lys     #            205     -      Leu Phe Val Ala Arg Ile Phe Pro - # Ser Leu Lys Lys Leu Arg Ala     Ile     #        220     -      Leu Trp Ser Ile Leu Pro Phe Gln - # His Pro Pro Tyr Ser Ser Ile     Arg     #    240     -      Ala Met Leu Leu Asp Ile Cys Ser - # Lys Asn Thr Pro Leu Lys Lys     Met     #   255     -      His Thr His Ser Leu Lys Trp Asn - # Phe Met Arg Ala Ala Met Ser     Gly     #                270     -      Lys Asn Lys Leu Lys Asn Gly Ser - # Asn Cys Asn Ala Asn Lys Asn     Leu     #            285     -      Lys Ser Lys Leu Asn Ser Gln Lys - # Gly Ala Ile Ala His Thr Ile     Arg     #        300     -      Val Leu Cys Leu Ser Ala Leu Glu - # Arg Phe Met Lys Ala Phe Ser     Glu     #    320     -      Glu Cys Leu Met Ile Ile Phe Gly - # Lys Asp Tyr Leu Ser Thr Asp     Leu     #   335     -      Gln Asn Ser Ala Lys Asp Ile Leu - # Leu Ile Ala Ser Ile Leu Lys     Glu     #                350     -      Arg Leu Phe Thr His Lys Asn Glu - # Ile Phe Phe Cys Pro Arg Asn     Ser     #            365     -      His Ile Gln Ala Phe Arg Ile Gln - # Glu Arg Lys Ile Thr Thr Ser     Phe     #        380     -      His Gly Gly Ile Asn Asn Asn Ile - # Cys Leu Phe Thr Leu Lys Gly     Ile     #    400     -      His Ser Val Tyr Phe Glu Leu Ile - # Lys Ile Leu Glu Ala Val Phe     Phe     #   415     -      His Phe Ser Gln Ala Ser Phe Leu - # Gly Ile Ile Arg Cys Asp Asn     Ser     #                430     -      Tyr Gly Phe Ser Asp Phe Ser Leu - # Val Lys Ser Ala Ile Asn Leu     Leu     #            445     -      Lys Leu Tyr Asp Phe Leu Asn Leu - # Ile Phe Val Ile Ala Thr Cys     Phe     #        460     -      Phe Phe Met Leu Ser Leu Pro Leu - # Ile Asn Phe Arg Thr Ile Ala     Arg     #    480     -      Thr His His Asn Gln Thr Phe Ala - # Lys Gly Val Phe Leu Gly Glu     Ile     #   495     -      Ser Met Gln Thr Leu Lys Asn Lys - # Ala Phe Arg Cys Leu Ser Gly     Tyr     #                510     -      Phe Lys Val Lys Cys Pro Thr Ala - # Lys Asp Asp Trp Ala Ser Ala     Asp     #            525     -      Val Ala Arg Arg Glu Ser Leu Val - # Leu Thr Leu Cys Pro Tyr Asn     Arg     #        540     -      Asp Phe Thr Glu Phe Glu Leu Gln - # Arg Ile Pro Tyr Arg Ser His     Thr     #    560     -      Ser Tyr Leu Gln Arg Arg Asn Phe - # Ala His Phe Leu Arg Ile Arg     Met     #   575     -      Tyr Gly Asp Met Ile Pro Ile Gly - # Ser Ile His Pro Lys Tyr Arg     Arg     #                590     -      Leu Gly Phe Leu Leu Arg Ser Ile - # Lys Lys Pro Leu Ala Phe Ser     Lys     #            605     -      Gly Trp Phe Leu Arg Phe Trp Tyr - # Ser Leu Ile Leu Asp Phe Trp     Lys     #        620     -      Pro Pro Phe Val Leu Thr Phe Leu - # Ile Pro Cys Ser Ala Phe Ser     Leu     #    640     -      Ala Lys Gln Asp Ala Val Glu Asn - # Ala Leu Val Lys Asn Cys Val     Lys     #   655     -      Pro Glu Leu Val Ser Phe Thr Ala - # Leu Phe Asn Leu Val Glu Asn     Leu     #                670     -      Asn Glu Ser Glu Ser Phe Ile Phe - # Phe Trp Leu Lys Pro Ile Leu     Ser     #            685     -      Tyr Ser Asp Lys Gly Thr Leu Pro - # Thr Leu Cys Ala Thr Ile Leu     Ser     #        700     -      Phe Thr Ser Leu Ser His Ser Gly - # Phe Phe Cys Val Ser Met Pro     Val     #    720     -      Ala Phe Ser Leu Ile Ala Pro Phe - # Leu Ile Leu Leu Leu Met Arg     Met     #   735     -      Trp Leu Leu Val Ala Leu Ser Ser - # Phe Ala Asp Lys Leu Ala Gly     Thr     #                750     -      Phe Phe Ala Leu Ser Thr His Arg - # Phe Met Gly Leu Cys Phe Val     Glu     #            765     -      Ser Leu Ser Leu Ser Ile Val Cys - # Ser Arg Leu Pro Lys Lys Pro     Ala     #        780     -      Leu Ala Ser Asp Thr Ala Leu Arg - # Leu Ser Ile Phe Ala Asn Ser     Phe     #    800     -      Leu Glu Ser Pro Thr Leu Leu Arg - # Ser Ser Thr Leu Ser Cys Leu     Phe     #   815     -      Lys Gly Lys Pro Ala Ala Cys Pro - # Thr Thr Pro Lys Pro Leu Ala     Ile     #                830     -      Leu Ser Arg Ser Ile Phe Ala Phe - # Thr Phe Leu Ala Thr Ala Met     Gly     #            845     -      Ser Ser Arg Leu Ala Ala Cys Pro - # Ala Phe Phe Leu Leu Thr Leu     Ala     #        860     -      Met Gly Ser Phe Leu Ser Pro Leu - # Leu Leu Leu Leu Leu Lys Phe     Pro     #    880     -      Ser Phe Ala Phe Asn Ser Phe Leu - # Met Ser Glu Lys Phe Leu Glu     Arg     #   895     -      Val Val Ala Ser Ala Asp Asn Pro - # Leu Pro Thr Arg Val Pro Phe     Thr     #                910     -      Pro Phe Leu Thr Asp Tyr Ile Ser - # Asp Phe Phe Leu Ala Leu Arg     Leu     #            925     -      Ser Phe Phe Ala Cys Trp Ala Asn - # Cys Ser Phe Glu Lys Phe Leu     Arg     #        940     -      Ser Ala Asn Ala Cys Ser Thr Leu - # Leu Lys Ser Pro Val Ala Leu     Ala     #    960     -      Thr Asp Thr Ala Leu Arg Leu Ser - # Thr Leu Ser Val Thr Phe Leu     Met     #   975     -      Ile Thr Ser Leu Thr Glu Phe Ser - # Arg Ser Leu Phe Ala Cys Val     Thr     #                990     -      Leu Leu Lys Ser Leu Phe Leu Pro - # Phe Leu Asn Ser Phe Lys Ala     Ala     #           10050     -      Leu Arg Phe Ser Thr Phe Glu Ile - # His Ser Gly Leu Ile Pro Lys     Ser     #       10205     -      Phe Thr Glu Pro Leu Arg Ala Phe - # Ser Val Ser Ser Ala Leu Leu     Lys     #  10405     -      Ser Leu Phe Leu Pro Phe Leu Asn - # Ser Ser Lys Asp Leu Ser Lys     Ser     # 10550     -      Phe Lys Phe Leu Leu Thr Phe Ser - # Ser Leu Ser Asp Asn Ser Leu     Leu     #               10700 - #                1065     -      Met Pro Leu Arg Phe Ala Ala Ile - # Ala Leu Ala Ser Leu Leu Ala     Ser     #           10850     -      Leu Leu Ile Asn Ala Lys Ile Ser - # Ser Phe Trp Leu Leu Ala Ala     Phe     #       11005     -      Ala Ser Ile Leu Phe Leu Leu Pro - # Leu Leu Leu Ser Asn Phe Phe     Ser     #  11205     -      Thr Ser Phe Ser Lys Cys Ser Arg - # Phe Leu Arg Asp Phe Ser Arg     Ser     # 11350     -      Phe Ala Phe Phe Thr Ser Ser Leu - # Pro Val Phe Leu Ala Ser Ala     Thr     #               11500 - #                1145     -      Ala Leu Leu Lys Phe Lys Val Phe - # Pro Thr Asn Ser Leu Leu Leu     Leu     #           11650     -      Lys Lys Ser Leu Ile Ser Leu Leu - # Ala Ser Cys Gly Asp Asn Pro     Leu     #       11805     -      Val Val Ser Leu Ser Ser Lys Phe - # Arg Leu Thr Lys Leu Val Ile     Ala     #  12005     -      Arg Leu Phe Lys Ser Gly Lys Leu - # Lys Ile Ala Thr Leu Leu Lys     Pro     # 12150     -      Thr Ser Lys Trp Glu Thr Pro Phe - # Val Thr Pro Thr Pro Leu Leu     Gly     #               12300 - #                1225     -      Phe Leu Glu Ala Leu Val Tyr Leu - # Lys Leu Glu Ser Thr Asn Ile     Thr     #           12450     -      Pro Ser Cys Phe Arg Leu Pro Arg - # Val Thr Phe Phe Ser Leu Ser     Lys     #       12605     -      Ala Leu Ser Ala Phe Phe Pro Ser - # Leu Arg Val Leu Lys Ser Pro     Leu     #  12805     -      Pro Asn Ser Val Ile Lys Ala Glu - # Cys Phe Val Ser Phe Leu Glu     Thr     # 12950     -      Lys Ala Ile Arg Ser Phe Pro Arg - # Ala Ser Lys Ala Leu Glu Ser     Phe     #               13100 - #                1305     -      Trp Lys Ser Leu Ile Ser Val Arg - # Asn Phe Ser Phe Ser Phe Ser     Leu     #           13250     -      Lys Leu Ser Asn Leu Ala Leu Phe - # Cys Ala Arg Asn Ser Met Lys     Ser     #       13405     -      Ile Leu Phe Trp Ile Ser Ser Leu - # Asn Ala Arg Cys Glu Pro Val     Ser     #  13605     -      Trp Ser Ser Leu Arg Lys Leu Gly - # Leu Leu Ile Pro Phe Ser Pro     Pro     # 13750     -      Ala Met Thr Lys Pro Leu Pro Phe - # Phe Ile Cys Thr Leu Ile Ile     Val     #               13900 - #                1385     -      Ala Thr Leu Leu Pro Cys Trp Ser - # Leu Pro Thr Val Ala Phe Gln     Ser     #           14050     -      Cys Leu Ala Pro Gly Pro Pro Leu - # Pro Pro Tyr Asn Asn Glu Thr     Phe     #       14205     -      Ser Gly Ser Met Pro Leu Leu Pro - # Ile Asn Thr Glu Asp Arg Ala     Leu     #  14405     -      Leu Ile Asn Asn Leu Asn Leu Leu - # Gly Ser Met Ser Ala Thr Pro     Ser     # 14550     -      Thr Ser Asn Ile Ser Ile Ser Pro - # Arg Val Asn Leu Glu Lys Leu     Pro     #               14700 - #                1465     -      Leu Ser Ser Ser Lys Ser Leu Ala - # Ser Gly Gly Lys Pro Cys Met     Ser     #           14850     -      Val Val Val Val Ala Ile Ser Gly - # Trp Thr Trp Gly Thr Gly Ser     Leu     #       15005     -      Ile Ala Ser Leu Thr Ser Glu Asp - # Cys Phe Leu Ser Asn Ile Asn     Glu     #  15205     -      Arg Lys Ile Ser Asn Gln Ser Pro - # Pro Val Gly Ser Pro Phe Phe     Ser     # 15350     -      Ala Ser Cys Leu Ser Phe Lys Asp - # Ser Ser Asn Thr Pro Met Asn     Phe     #               15500 - #                1545     -      Ser Val Arg Ile Phe Pro Ile Met - # Ile Pro Ala Lys Asp Cys Leu     Ala     #           15650     -      Asp Phe Lys Asn Ser Ala Phe Ser - # Leu Ser Ser Arg Ile Gly Gly     Cys     #       15805     -      Met Ile Phe Ser Ile Lys Phe Arg - # Ile Asp Arg Val Leu Ile Leu     Asp     #  16005     -      Gly Ser Phe Trp Asp Thr Gln Leu - # Val Lys Ile Arg Arg Ser Pro     Asn     # 16150     -      Phe Lys Leu Phe Val Glu Asp Ser - # Thr Ser Met Arg Leu Ser Leu     Leu     #               16300 - #                1625     -      Ile Lys Ser Leu Leu Leu Leu Ile - # Lys Ser Glu Lys Tyr Phe Phe     Leu     #           16450     -      Val Gly Phe Leu Ile Ala Leu Leu - # Glu Tyr Ser Ser Leu Asn Cys     Glu     #       16605     -      Ile Pro Ser Lys Ala Cys Leu Leu - # Ser Leu Ser Phe Leu Ser Thr     Ile     #  16805     -      Gly Phe Ser Gly Ser Tyr Glu Ala - # Thr Ala Leu Ser Thr Leu Arg     Lys     # 16950     -      Ala Thr Arg Leu Leu Ile Asn Cys - # Cys Gly Leu Lys Ala Ala Ser     Val     #               17100 - #                1705     -      Cys Gly Cys Trp Ser Met Val Ser - # Leu Val Ile Val Ser Pro Tyr     Tyr     #           17250     -      Thr Phe His Thr Tyr Arg Tyr Leu - # Val Ser Val Lys Asp Cys Lys     Ser     #       17405     -      Leu His Ala Val Val Asn Ser Cys - # Leu Thr Phe Ser Leu Tyr Asn     Asn     #  17605     -      Ile Lys His Phe Phe Phe Phe Ser - # Leu Ile Ile Ile Asn Ile Glu     Lys     # 17750     -      Ser Pro Ile Val Ala Gly Val Arg - # Leu Gly Arg Phe Thr Phe Leu     Pro     #               17900 - #                1785     -      Thr Ile Asn Lys Asn Leu Leu Lys - # Lys Asn Phe Asn Lys Asn Lys     Lys     #           18050     -      Glu Met Asn Leu Phe Glu Glu Ile - # Lys Cys Asn Ala Lys Val Trp     Leu     #       18205     -      Glu Lys Trp Leu Val Lys Val Ser - # Thr Ser Val Leu Lys Ile Val     Trp     #  18405     -      Phe Val Cys His Ala Tyr Tyr Ser - # Met Tyr Asn Ala Phe Val Ile     His     # 18550     -      Lys Ile His Met Gly Asn Lys Met - # Glu Asn Lys Ser Ile Gly Gln     Ile     #               18700 - #                1865     -      Phe Lys Asp Ser Leu Lys Lys Ser - # Phe Phe Ser Gly Leu Trp Ser     Cys     #           18850     -      Leu Lys Trp Ser Phe Ile Leu Thr - # Leu Ile Ser Leu Gly Leu Phe     Leu     #       19005     -      Leu Val Phe Arg Phe Gln Pro Glu - # Thr Ile Lys Lys Tyr Ile Lys     Asp     #  19205     -      Pro Lys Asp Leu Gln Phe Tyr Asn - # Asp Leu Arg Lys Lys Asn Gly     Trp     # 19350     -      Asp Lys Val Tyr Tyr Ser Met Met - # Ser Val Ala Glu Gln Lys Arg     Arg     #               19500 - #                1945     -      Phe Phe Lys Leu Ser Ser Ala Cys - # Met Asn Ser Ile Lys Ile Leu     Phe     #           19650     -      Phe Leu Ile Tyr Ser Ser Asp Phe - # Gln Phe Glu Arg Lys Arg Met     Lys     #       19805     -      Phe Phe Thr Arg Ile Thr Asp Ser - # Tyr Lys Lys Val Val Val Thr     Leu     #  20005     -      Gly Leu Val Val Thr Thr Asn Pro - # Leu Met Ala Val Ala Ser Pro     Thr     # 20150     -      Glu Gly Val Thr Ala Thr Lys Gly - # Leu Val Ile Gln Ile Ile Ser     Val     #               20300 - #                2025     -      Leu Ala Ile Val Gly Gly Cys Ala - # Leu Gly Val Lys Gly Ile Ala     Asp     #           20450     -      Ile Trp Lys Ile Ser Asp Asp Ile - # Lys Arg Gly Gln Ala Thr Val     Phe     #       20605     -      Ala Tyr Ala Gln Pro Ile Ala Met - # Leu Ala Val Ala Gly Gly Ile     Ile     #  20805     -      Tyr Leu Ser Thr Lys Phe Gly Phe - # Asn Ile Gly Glu Gly Gly Gly     Ala     # 20950     -      Ser Val Asp Gln Gln Gln Glu Thr - # Lys Arg Leu Phe Phe Glu Ser     Ser     #               21100 - #                2105     -      Leu Lys Ser Arg Cys Phe Gln Phe - # Val Trp Val Ser Lys Gln Arg     Ser     #           21250     -      Gln Lys Arg Ser Thr Arg Lys Arg - # Lys Lys His Ser Gln Trp Ala     Cys     #       21405     -      Leu Tyr Lys Phe Arg Phe Phe Gln - # Ser Asp Tyr Gln Lys Phe Glu     Arg     #  21605     -      Gln Glu Ser Asn Phe Gln Arg Ser - # Gln Ser Arg Tyr Tyr Gln Arg     Ser     # 21750     -      Phe Leu Arg Asp Cys Gln Ser Pro - # Lys Asn Lys Gly Phe Glu Ala     Leu     #               21900 - #                2185     -      Arg Ser Ser Tyr Lys Asn Leu Lys - # Gly Gly Asn Gly Ile Glu Cys     Arg     #           22050     -      Ile Thr Asp Gln Ser Ser Glu Arg - # Leu Phe Tyr Phe Gln Lys His     Gln     #       22205     -      His Asp Gln Gln Thr Thr Leu Leu - # Ser Val Ile Ala Arg Arg Asn     Lys     #  22405     -      Arg Asn Ser Met Arg Val Ile Cys - # Phe Ile Gly Arg Ile Gln Phe     Leu     # 22550     -      Ser Ser Phe Glu Thr Tyr His Arg - # Arg Val Ser Gln Lys Asn Leu     Leu     #               22700 - #                2265     -      Tyr Asp Tyr Met Ser Gly Ile Ser - # Ile Lys Arg Glu Val Phe Val     Ala     #           22850     -      Ser Lys Gln Ala Asp Glu Gln Lys - # Lys Leu Ile Ile Glu Gln Glu     Val     #       23005     -      Gln Lys Arg Gln Phe Gln Lys Ile - # Glu Glu Leu Lys Ala Asp Met     Gln     #  23205     -      Lys Gly Val Asn Pro Phe Phe Lys - # Val Leu Phe Asp Gly Gly Asn     Arg     # 23350     -      Leu Phe Gly Phe Pro Glu Thr Phe - # Ile Tyr Ser Ser Ile Phe Ile     Leu     #               23500 - #                2345     -      Phe Val Thr Ile Val Leu Ser Val - # Ile Leu Phe Gln Ala Tyr Glu     Pro     #           23650     -      Val Leu Ile Val Ala Ile Val Ile - # Val Leu Val Ala Leu Gly Phe     Lys     #       23805     -      Lys Asp Tyr Arg Leu Tyr Gln Arg - # Met Glu Arg Ala Met Lys Phe     Lys     #  24005     -      Lys Pro Phe Leu Phe Lys Gly Val - # Lys Asn Lys Ala Phe Met Ser     Ile     # 24150     -      Phe Ser Met Lys Pro Ser Lys Glu - # Met Ala Asn Asp Ile His Leu     Asn     #               24300 - #                2425     -      Pro Asn Arg Glu Asp Arg Leu Val - # Ser Ala Ala Asn Ser Tyr Leu     Ala     #           24450     -      Asn Asn Tyr Glu Cys Phe Leu Asp - # Asp Gly Val Ile Leu Thr Asn     Asn     #       24605     -      Tyr Ser Leu Leu Gly Thr Ile Lys - # Leu Gly Gly Ile Asp Phe Leu     Thr     #  24805     -      Thr Ser Lys Lys Asp Leu Ile Glu - # Leu His Ala Ser Ile Tyr Ser     Val     # 24950     -      Phe Arg Asn Phe Val Thr Pro Glu - # Phe Lys Phe Tyr Phe His Thr     Ile     #               25100 - #                2505     -      Lys Lys Lys Ile Val Ile Asp Glu - # Thr Asn Arg Asp Tyr Gly Leu     Ile     #           25250     -      Phe Ser Asn Asp Phe Met Arg Ala - # Tyr Asn Glu Lys Gln Lys Arg     Glu     #       25405     -      Ser Phe Tyr Asp Ile Ser Phe Phe - # Leu Thr Ile Glu Gln Asp Leu     Leu     #  25605     -      Asp Thr Leu Asn Glu Pro Val Met - # Asn Lys Lys His Phe Ala Asp     Asn     # 25750     -      Asn Phe Glu Glu Phe Gln Arg Ile - # Ile Arg Ala Lys Leu Glu Asn     Phe     #               25900 - #                2585     -      Lys Asp Arg Ile Glu Leu Ile Glu - # Glu Leu Leu Ser Lys Tyr His     Pro     #           26050     -      Thr Arg Leu Lys Glu Tyr Thr Lys - # Asp Gly Val Ile Tyr Ser Lys     Gln     #       26205     -      Cys Glu Phe Tyr Asn Phe Leu Val - # Gly Met Asn Glu Ala Pro Phe     Ile     #  26405     -      Cys Asn Arg Lys Asp Leu Tyr Leu - # Lys Glu Lys Met His Gly Gly     Val     # 26550     -      Lys Glu Val Tyr Phe Ala Asn Lys - # His Gly Lys Ile Leu Asn Asp     Asp     #               26700 - #                2665     -      Leu Ser Glu Lys Tyr Phe Ser Ala - # Ile Glu Ile Ser Glu Tyr Ala     Pro     #           26850     -      Lys Ser Gln Ser Asp Leu Phe Asp - # Lys Ile Asn Ala Leu Asp Ser     Glu     #       27005     -      Phe Ile Phe Met His Ala Tyr Ser - # Pro Lys Asn Ser Gln Val Leu     Lys     #  27205     -      Asp Lys Leu Ala Phe Thr Ser Arg - # Arg Ile Ile Ile Ser Gly Gly     Ser     # 27350     -      Lys Glu Gln Gly Met Thr Leu Gly - # Cys Leu Ser Glu Leu Val Gly     Asn     #               27500 - #                2745     -      Gly Asp Ile Thr Leu Gly Ser Tyr - # Gly Asn Ser Leu Val Leu Phe     Ala     #           27650     -      Asp Ser Phe Glu Lys Met Lys Gln - # Ser Val Lys Glu Cys Val Ser     Ser     #       27805     -      Leu Asn Ala Lys Gly Phe Leu Ala - # Asn Ala Ala Thr Phe Ser Met     Glu     #  28005     -      Asn Tyr Phe Phe Ala Lys His Cys - # Ser Phe Ile Thr Leu Pro Phe     Ile     # 28150     -      Phe Asp Val Thr Ser Asn Asn Phe - # Ala Asp Phe Ile Ala Met Arg     Ala     #               28300 - #                2825     -      Met Ser Phe Asp Gly Asn Gln Glu - # Asn Asn Ala Trp Gly Asn Ser     Val     #           28450     -      Met Thr Leu Lys Ser Glu Ile Asn - # Ser Pro Phe Tyr Leu Asn Phe     His     #       28605     -      Met Pro Thr Asp Phe Gly Ser Ala - # Ser Ala Gly His Thr Leu Ile     Leu     #  28805     -      Gly Ser Thr Gly Ser Gly Lys Thr - # Val Phe Met Ser Met Thr Leu     Asn     # 28950     -      Ala Met Gly Gln Phe Ala Tyr Asn - # Phe Pro Ala Asn Val Ser Lys     Asp     #               29100 - #                2905     -      Lys Gln Lys Leu Thr Met Val Tyr - # Met Asp Lys Asp Tyr Gly Ala     Tyr     #           29250     -      Gly Asn Ile Val Ala Met Gly Gly - # Glu Tyr Val Lys Ile Glu Leu     Gly     #       29405     -      Thr Asp Thr Gly Leu Asn Pro Phe - # Ala Trp Ala Ala Cys Val Gln     Lys     #  29605     -      Thr Asn Ala Thr Met Glu Gln Lys - # Gln Thr Ala Ile Ser Val Val     Lys     # 29750     -      Glu Leu Val Lys Asn Leu Ala Thr - # Lys Ser Asp Glu Lys Asp Glu     Asn     #               29900 - #                2985     -      Gly Asn Ser Ile Ser Phe Ser Leu - # Ala Asp Ser Asn Thr Leu Ala     Ala     #           30050     -      Gln Pro Thr Leu Ser Gln Glu Ile - # Ile Ile Ile Pro Ser Leu Asn     Leu     #       30205     -      Leu Met Leu Ser Gly Lys Thr Thr - # Met Ile Leu Met Gly Leu Ser     Arg     #  30405     -      Asp Arg Leu Phe Ala Asn Gln Pro - # Met Val Asn Phe Asn Gly Phe     Leu     # 30550     -      Ile Ile Lys Gln Gln Ile Ala Ile - # Phe Gln Lys Arg Leu Leu Ala     Leu     #               30700 - #                3065     -      Met Gly Gln Val Ser Thr Ile Met - # Met Phe Arg Pro Leu Phe Val     Phe     #           30850     -      Thr Phe Ser Leu Val Ser Lys Arg - # Gln Trp Met Gly Val Asp Leu     Ser     #       31005     -      Ile Leu Met Lys Pro Gly Asn Ile - # Arg Ser Lys Gly Arg Leu Phe     Cys     #  31205     -      Lys Arg His Ala Lys Asn Cys Lys - # Glu Lys Lys Arg Tyr Cys Gln     Thr     # 31350     -      Cys Asp Ser Lys His His Ser Phe - # Gly Leu Pro Tyr Cys Tyr Asp     Arg     #               31500 - #                3145     -      Thr Met Pro Tyr Lys Asp Phe Phe - # Glu Lys Arg Trp Gly Gln Ser     Phe     #           31650     -      Leu Pro Lys Ile Gly Cys Tyr Arg - # Lys Arg Ile Asn His His Gly     Thr     #       31805     -      Arg Glu Asn Pro Leu Gln Thr Gly - # Trp Lys Pro Arg Tyr Arg Phe     Phe     #  32005     -      Glu Arg His Ser Arg Ile Phe Glu - # Asn Phe Ile His Arg Tyr Cys     Ile     # 32150     -      Cys Gln Arg Asn Asp Tyr Pro Lys - # Pro Tyr His Arg Ile Ser Gly     Leu     #               32300 - #                3225     -      Glu Thr Asn Val Ser Thr Asn Arg - # Gly Val Leu Lys Asn Lys Ile     Cys     #           32450     -      Val Asn Lys Asn Tyr Gly Asn Phe - # Arg Lys Cys Leu Asp Asp Phe     Gly     #       32605     -      Phe Phe Leu Ser Phe Thr Glu Arg - # Arg Ile Gly Leu His Arg Arg     Arg     #  32805     -      Asn Gly Arg Arg Asn His Thr     #   3285     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 3200 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:8: (xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Leu Cys Leu Phe Asn Ala Thr - # Asn Gly Gln Arg Ile Phe Lys     Leu     #   15     -      Asn Glu Lys Ile Lys Ser Leu Ile - # Asn Pro Ile Lys Leu Lys Lys     Cys     #                 30     -      His Glu Asn Arg Arg Phe Arg Cys - # Gly Arg Val Arg Phe Lys Lys     Pro     #             45     -      Phe Lys Ser Ala Ile Ile Asn His - # Leu Leu Trp Arg Arg Ser Ala     Leu     #         60     -      Trp His Arg Pro His Tyr Asp Gln - # Ala Ile Trp Leu Arg Gly Phe     Gly     #     80     -      Phe Phe Gln Thr Ala Pro Asp Ile - # Ile Asp Cys Gly Met Gln His     Ser     #   95     -      Glu Arg Ser Ser Phe Arg Arg Asp - # Ala Lys Ala Phe Gln Asn Pro     Tyr     #                110     -      Cys Gly Arg Asp Ala Lys His Phe - # Ser Asp Gln Ala Thr Ser Lys     Arg     #            125     -      Lys Arg Pro Tyr Phe Ser Ala Arg - # Asp Lys Ser Asp Asp Pro Ile     Gln     #        140     -      Arg Leu Gln Cys Pro Glu Thr Thr - # Arg Leu Phe Glu Cys Phe Ala     Phe     #    160     -      Ser His Phe Ser Phe Cys Ala Phe - # Asp Arg Lys Tyr Phe Arg Gly     Arg     #   175     -      Ile Val Arg Asn Leu His Ala Leu - # Leu Phe His Ser Leu Lys Asp     Phe     #                190     -      Pro Glu Val Ile Ile Leu Gly Cys - # Thr His Phe Pro Leu Ile Ala     Gln     #            205     -      Lys Ile Glu Gly Tyr Phe Met Glu - # His Phe Ala Leu Ser Thr Pro     Pro     #        220     -      Leu Leu Ile His Ser Gly Asp Ala - # Ile Val Gly Tyr Leu Gln Gln     Lys     #    240     -      Tyr Ala Leu Lys Lys Asn Ala His - # Ala Phe Pro Lys Val Glu Phe     His     #   255     -      Ala Ser Gly Asp Val Ile Trp Leu - # Glu Lys Gln Ala Lys Glu Trp     Leu     #                270     -      Lys Leu Arg Lys Phe Lys Val Lys - # Ile Thr Gln Phe Pro Lys Arg     Gly     #            285     -      Tyr Ser Ser His Tyr Lys Gly Ile - # Val Leu Ile Ile Lys Arg Phe     Arg     #        300     -      Glu Ile Tyr Glu Ser Val Leu Lys - # Arg Arg Met Ser Asn Asp Tyr     Leu     #    320     -      Trp Lys Arg Leu Pro Ile Tyr Arg - # Leu Ala Lys Arg Arg Tyr Ser     Ser     #   335     -      His Arg Phe Val Asn Ser Gln Arg - # Lys Thr Phe Tyr Pro Gln Lys     Asp     #                350     -      Phe Leu Leu Pro Lys Pro His Ser - # Ser Val Asn Leu Ala Arg Lys     Lys     #            365     -      Asp Tyr His Lys Phe Ser Arg Trp - # Asn Lys Tyr Leu Pro Leu His     Leu     #        380     -      Glu Arg His Pro Gln Cys Leu Phe - # Ala His Gln Asn Ser Ser Arg     Ile     #    400     -      Phe Pro Leu Leu Ile Ala Ser Val - # Phe Phe Gly His Tyr Lys Val     Phe     #   415     -      Leu Trp Phe Phe Phe Phe Phe Gly - # Glu Arg Asp Gln Phe Ala Leu     Lys     #                430     -      Ala Leu Phe Phe Glu Leu Asp Leu - # Cys Tyr Ser Tyr Leu Phe Phe     Leu     #            445     -      Val Tyr Ala Leu Phe Ile Thr Leu - # Asn Phe Ser His Tyr Ser Thr     Asn     #        460     -      Pro Leu Ala Gln Ser Asn Leu Ile - # Leu Ser Lys Gly Leu Ser Leu     Phe     #    480     -      Gly Val Gly Asn Ile His Ala Asp - # Phe Lys Glu Gln Ser Leu Ser     Leu     #   495     -      Val Ser Val Gly Ile Phe Gly Glu - # Val Pro Asn Arg Arg Arg Leu     Gly     #                510     -      Phe Leu Gly Cys Cys Pro Gly Glu - # Phe Gly Ser His Ser Val Ser     Leu     #            525     -      Gly Phe Tyr Arg Val Ala Pro Thr - # His Ser Leu Gln Val Ser His     Ile     #        540     -      Ile Ser Pro Lys Ala Leu Cys Thr - # Leu Pro Thr Leu Asp Thr Asn     Val     #    560     -      Trp Gly Tyr Asp Thr Met Asn Trp - # His Ser Ser Gln Ile Leu Lys     Thr     #   575     -      Ile Gly Ile Ser Ala Lys Glu Tyr - # Lys Thr Pro Arg Phe Gln Gly     Val     #                590     -      Val Phe Ala Phe Leu Val Phe Leu - # Asn Pro Leu Arg Phe Leu Glu     Thr     #            605     -      Thr Phe Cys Ile Asn Val Leu Asp - # Ser Met Leu Arg Ile Phe Ser     Arg     #        620     -      Gln Ala Val Ile Arg Cys Cys Arg - # Lys Cys Ile Gly Lys Leu Arg     Lys     #    640     -      Ala Arg Ile Ser Val Phe Tyr Ser - # Ile Val Gln Leu Gly Gly Lys     Leu     #   655     -      Glu Arg Ile Arg Ile Ile Phe His - # Ile Leu Leu Val Glu Ala Asn     Phe     #                670     -      Ile Ile Leu Gln Arg Asn Ala Ser - # Tyr Ile Met Arg Asn Tyr Leu     Ile     #            685     -      Ile His Glu Leu Glu Pro Leu Arg - # Val Phe Leu Arg His Ala Gly     Arg     #        700     -      Phe Phe Ile Asp Cys Ser Ile Phe - # Asp Ile Ala Ile Asn Ala Tyr     Val     #    720     -      Ala Val Ser Ser Val Ile Val Phe - # Arg Gln Thr Ser Arg Tyr Phe     Phe     #   735     -      Cys Thr Phe Asn Pro Ile His Gly - # Ile Val Phe Cys Arg Ile Phe     Glu     #                750     -      Leu Val Tyr Arg Leu Leu Ile Ala - # Lys Lys Thr Cys Phe Ser Phe     Tyr     #            765     -      Arg Leu Ile Glu Ile Val Asn Phe - # Leu Ser Gln Phe Leu Ile Pro     Lys     #        780     -      Pro Tyr Leu Thr Glu Ile Ile Asn - # Phe Ile Met Pro Phe Gln Arg     Glu     #    800     -      Ala Arg Cys Leu Pro Tyr Asn Thr - # Gln Thr Thr Cys Tyr Leu Ile     Glu     #   815     -      Ser Val Asn Phe Cys Ile Tyr Leu - # Phe Ser Asn Leu Ser Val Asn     Gly     #                830     -      Phe Phe Lys Ala Ser Cys Leu Pro - # Cys Phe Leu Phe Ile Asn Phe     Ser     #            845     -      Ile Asn Gly Phe Val Phe Glu Ser - # Ile Ile Val Ile Val Ile Glu     Ile     #        860     -      Ser Lys Phe Cys Ile Gln Leu Phe - # Leu Asp Val Arg Lys Val Phe     Arg     #    880     -      Lys Ser Cys Gly Phe Cys Leu Arg - # Pro Ile Thr Asp Gly Ser Ile     His     #   895     -      Thr Ile Leu Asn Gly Leu Ile Tyr - # Phe Arg Phe Phe Ser Ser Ile     Glu     #                910     -      Thr Phe Ile Phe Leu Ser Leu Leu - # Gly Gln Leu Leu Leu Glu Ile     Phe     #            925     -      Glu Ile Gly Arg Leu Leu Tyr Pro - # Thr Glu Ile Thr Arg Cys Phe     Ser     #        940     -      His Tyr Arg Leu Ile Glu Ile Ile - # Asn Phe Ile Arg Tyr Leu Leu     Ile     #    960     -      Asp Asp His Ile Phe Asn Gly Ile - # Phe Lys Val Ala Phe Cys Leu     Arg     #   975     -      Tyr Leu Ala Glu Ile Leu Ile Phe - # Ala Ile Phe Glu Phe Ile Gln     Ser     #                990     -      Cys Ile Lys Val Phe Asn Phe Asn - # Pro Phe Trp Ile Asp Thr Ile     Phe     #           10050     -      His Arg Thr Phe Lys Gly Phe Cys - # Phe Phe Cys Leu Ala Glu Ile     Leu     #       10205     -      Ile Phe Ala Ile Phe Glu Phe Ile - # Lys Arg Phe Ile Lys Val Phe     Gln     #  10405     -      Ile Leu Val Asp Ile Phe Lys Phe - # Ile Arg Gln Phe Pro Phe Asp     Ala     # 10550     -      Phe Lys Ile Leu Ser Val Ser Asp - # Cys Ser Cys Val Ser Ile Ser     Leu     #               10700 - #                1065     -      Phe Ile Asp Gln Arg Lys Asn Leu - # Ile Phe Leu Ala Val Ser Leu     Ser     #           10850     -      Phe Cys Phe His Phe Ile Phe Val - # Ala Ala Phe Ala Leu Gln Phe     Phe     #       11005     -      Leu Tyr Phe Phe Leu Met Leu Ser - # Phe Pro Arg Phe Phe Lys Ile     Phe     #  11205     -      Leu Ser Phe Phe His Phe Ile Ile - # Ile Ala Cys Val Phe Ser Val     Ser     # 11350     -      Tyr Ser Phe Ile Glu Val Ser Phe - # Ser Asn Gln Phe Phe Val Ala     Ala     #               11500 - #                1145     -      Gln Lys Ile Phe Asp Lys Leu Ile - # Ser Phe Leu Trp Gly Gln Ser     Phe     #           11650     -      Ser Gly Phe Ile Leu Ile Pro Pro - # Tyr Glu Thr Ser Asp Ser Glu     Ile     #       11805     -      Ile Ile Arg Gln Ile Lys Asp Ser - # Tyr Leu Val Lys Ala Tyr Phe     Met     #  12005     -      Gly Asn Ala Ile Arg Asn Ala Tyr - # Thr Leu Ile Gly Ile Leu Gly     Gly     # 12150     -      Val Gly Val Phe Glu Ile Arg Ile - # Ile Asn Lys His His Ala Ile     Met     #               12300 - #                1225     -      Phe Ala Thr Leu Lys Ser Asn Ile - # Phe Leu Pro Ile Ser Phe Ile     Cys     #           12450     -      Phe Phe Pro Ile Ile Phe Glu Ser - # Val Ala Gln Ile Pro Ile Thr     Lys     #       12605     -      Leu Ser Asn Ser Met Phe Cys Val - # Phe Phe Arg Asn Lys Ser Asn     Thr     #  12805     -      Ile Ile Pro Gly Val Ile Ser Leu - # Arg Val Phe Leu Glu Ile Phe     Asn     # 12950     -      Leu Ser Ser Glu Phe Phe Leu Leu - # Phe Leu Ala Gln Val Val Phe     Ser     #               13100 - #                1305     -      Ile Ile Leu Cys Lys Lys Phe His - # Glu Ile Tyr Phe Val Leu Asp     Leu     #           13250     -      Phe Leu Thr Cys Ser Leu Ala Cys - # Glu Leu Val Phe Phe Val Glu     Ile     #       13405     -      Lys Thr Arg Val Val Asn Pro Phe - # Leu Thr Thr Cys Tyr Asp Ala     Thr     #  13605     -      Ala Val Phe His Met His Ile Asn - # Tyr Cys Ser His Ile Val Ala     Leu     # 13750     -      Leu Val Phe Ile Thr Asn Gly Gly - # Val Pro Ile Met Pro Ser Ser     Arg     #               13900 - #                1385     -      Thr Thr Ile Ala Pro Ile Gln Gln - # Asn Phe Phe Arg Phe Tyr Ala     Ile     #           14050     -      Met Thr Pro His His Arg Arg Gln - # Ser Val Ile Val Asn Gln Leu     Ile     #       14205     -      Glu Leu Val Ile Gly Ile Asn Val - # Ser Asp Ser Leu Asn Ile His     Phe     #  14405     -      His Ile Ala Lys Ser Glu Phe Arg - # Lys Ile Thr Pro Phe Ile Lys     Ile     # 14550     -      Ser Ser Phe Arg Arg Ala Leu Tyr - # Val Gly Gly Gly Ser Gly Tyr     Ile     #               14700 - #                1465     -      Trp Leu Asp Met Gly Asn Trp Phe - # Leu Ile Asp Cys Phe Phe Asp     Ile     #           14850     -      Arg Arg Leu Phe Phe Val Lys Tyr - # Lys Glu Lys Asn Ile Gln Pro     Ile     #       15005     -      Pro Thr Ser Arg Leu Ser Ile Phe - # Phe Cys Phe Leu Pro Phe Phe     Gln     #  15205     -      Gly Leu Ile Lys His Ala His Glu - # Leu Leu Ile Arg Ser Asp Leu     Ile     # 15350     -      Pro Tyr Asn Asp Ser Cys Lys Arg - # Leu Phe Gly Arg Phe Gln Lys     Leu     #               15500 - #                1545     -      Arg Phe Leu Phe Ile Ile Lys Asp - # Arg Gly Leu Tyr Asp Ile Phe     His     #           15650     -      Lys Ile Ser Asp Arg Ser Gly Val - # Asp Phe Arg Arg Ile Val Leu     Met     #       15805     -      Gly His Pro Thr Cys Glu Asn Ser - # Val Thr Leu Ile Pro Lys Phe     Leu     #  16005     -      Lys Ala Leu Cys Gly Arg Phe Tyr - # Ile Asn Glu Ile Val Phe Val     Asp     # 16150     -      Ile Ile Ala Leu Ile Asp Lys Val - # Lys Ile Leu Ile Leu Phe Gly     Arg     #               16300 - #                1625     -      Ile Phe Asp Arg Phe Ile Gly Val - # Phe Phe Pro Leu Arg Asp Ser     Phe     #           16450     -      Lys Ser Leu Pro Val Ile Pro Ile - # Val Leu Ile Asn Asp Trp Phe     Leu     #       16605     -      Ile Arg Ile Val Ser Asp Ser Val - # Ile Asn Phe Lys Lys Ser Tyr     Leu     #  16805     -      Lys Ile Ile Asp Lys Leu Leu Arg - # Val Lys Ser Arg Phe Gly Leu     Trp     # 16950     -      Leu Leu Val Asn Gly Phe Val Ser - # His Cys Phe Ser Leu Leu Tyr     Leu     #               17100 - #                1705     -      Val Ser Tyr Leu Ser Val Ser Cys - # Cys Gln Arg Leu Leu Lys Ile     Ala     #           17250     -      Thr Arg Leu Ser Ser Gln Phe Met - # Ser Leu Asn Ile Leu Ile Ile     Gln     #       17405     -      Gln Tyr Gln Ala Phe Phe Phe Phe - # Phe Ile Asn His Tyr Tyr Lys     Lys     #  17605     -      Pro Asp Arg Cys Arg Arg Leu Lys - # Ala Arg Thr Ile His Ile Phe     Thr     # 17750     -      His Asn Gln Lys Phe Ala Lys Met - # Lys Glu Leu Glu Gln Glu Ile     Arg     #               17900 - #                1785     -      Asn Glu Ser Phe Arg Asn Gln Met - # Gln Arg Ser Met Val Gly Glu     Met     #           18050     -      Val Ser Gln Gly Glu His Met Lys - # Arg Ser Lys Asn Ser Leu Val     Cys     #       18205     -      Val Ser Cys Leu Leu His Val Leu - # Ser Val Cys Asp Pro Gln Asp     Pro     #  18405     -      His Gly Lys Gln Asn Gly Lys Gln - # Ile Asn Arg Thr Asp Phe Gln     Arg     # 18550     -      Pro Gln Lys Lys Phe Leu Trp Ser - # Met Glu Leu Leu Lys Met Glu     Leu     #               18700 - #                1865     -      Tyr Ser His Ser Asp Gln Leu Gly - # Phe Val Phe Ala Cys Phe Val     Ser     #           18850     -      Thr Asp Asp Lys Ile His Gln Arg - # Ser Arg Ser Thr Ile Leu Gln     Arg     #       19005     -      Leu Glu Lys Glu Lys Trp Leu Gly - # Gln Val Gly Leu Leu Phe Leu     Asn     #  19205     -      Asp Val Ser Cys Arg Ala Lys Ala - # Ser Val Leu Ile Ile Ile Lys     Arg     # 19350     -      Leu Tyr Glu Tyr Lys Asn Thr Phe - # Phe Phe Asp Ile Leu Lys Arg     Leu     #               19500 - #                1945     -      Ile Ser Ile Lys Glu Thr His Glu - # Ile Phe Tyr Lys Asn His Gln     Leu     #           19650     -      Gln Glu Ser Cys Ser Asn Phe Arg - # Ala Ser Gly Asn Asn Gln Ser     Phe     #       19805     -      Asn Gly Gly Arg Gln Ser Tyr Arg - # Arg Arg His Cys Asn Arg Phe     Gly     #  20005     -      Tyr Ser Asp His Phe Cys Ser Ser - # Asp Arg Arg Trp Leu Arg Phe     Arg     # 20150     -      Gly Gln Arg His Ser Gly Tyr Leu - # Glu Asn Leu His Lys Arg Ser     Gly     #               20300 - #                2025     -      Asp Cys Phe Cys Leu Arg Ala Thr - # His Ser Tyr Val Ser Gly Gly     Arg     #           20450     -      Arg Tyr Tyr Leu Phe Glu His Val - # Trp Leu Gln Tyr Trp Arg Gly     Trp     #       20605     -      Arg Ser Leu Ser Ser Thr Ile Ile - # Ile Ile Ala Ile Arg Asn Glu     Ala     #  20805     -      Phe Phe Lys Phe Ser Val Ser Leu - # Phe Ser Val Arg Met Gly Gln     Met     # 20950     -      Met Thr Lys Lys Pro Lys Lys Lys - # His Lys Lys Lys Lys Lys Thr     Leu     #               21100 - #                2105     -      Pro Met Gly Leu Phe Ile Gln Ile - # Ile Leu Ile Val Ser Lys Arg     Leu     #           21250     -      Ser Lys Ile Lys Thr Arg Lys Leu - # Ser Lys Lys Ser Ile Pro Ile     Leu     #       21405     -      Ser Lys Met Lys Phe Leu Thr Ser - # Leu Ser Ile Glu Ser Leu Lys     Lys     #  21605     -      Arg Ile Ser Ile Thr Ile Gln Leu - # Leu Lys Lys Ser Leu Met Lys     Arg     # 21750     -      Val Arg Lys Trp Asp Met Asn Tyr - # Arg Ser Ile Leu Lys Lys Thr     Phe     #               21900 - #                2185     -      Leu Leu Ser Lys Ala Ser Ala Arg - # Pro Thr Asn Asn Ala Ala Phe     Cys     #           22050     -      His Cys Ala Glu Lys Gln Glu Lys - # Phe Tyr Ala Met Ile Ser Tyr     Ile     #       22205     -      Met Phe Tyr Trp Pro Tyr Ser Ile - # Leu Met Ile Leu Met Ile Phe     Asn     #  22405     -      Ile Leu Ala Pro Ser Leu Ser Lys - # Lys Ser Phe Ile Arg Leu His     Val     # 22550     -      Lys Arg Tyr Ile Tyr Lys Glu Arg - # Gly Val Cys Gly Lys Gln Ala     Gly     #               22700 - #                2265     -      Thr Lys Lys Ala Asn Tyr Arg Ala - # Arg Gly Ser Lys Ala Ala Val     Ser     #           22850     -      Lys Asn Arg Arg Thr Ser Arg His - # Ala Lys Gly Cys Gln Ser Leu     Phe     #       23005     -      Ser Leu Val Trp Gly Glu Val Val - # Trp Phe Pro Asn Phe Tyr Leu     Phe     #  23205     -      Leu Tyr Ile Tyr Ile Val Cys Asn - # His Cys Ile Ile Cys Tyr Ser     Phe     # 23350     -      Ser Ser Leu Thr Cys Phe Asp Cys - # Ser Asp Cys Tyr Cys Ala Cys     Ser     #               23500 - #                2345     -      Ser Trp Ile Gln Glu Arg Leu Ala - # Leu Ser Lys Asn Gly Ala Ser     Asp     #           23650     -      Glu Ile Lys Thr Phe Phe Val Gly - # Arg Glu Lys Gln Ser Val His     Glu     #       23805     -      His Phe Phe His Glu Ala Arg Asn - # Gly His Pro Leu Lys Ser Lys     Gln     #  24005     -      Arg Arg Gln Thr Cys Glu Arg Cys - # Lys Leu Leu Ser Ser Lys Leu     Met     # 24150     -      Phe Phe Arg Trp Gly Asp Pro Tyr - # Gln Leu Phe Ser Phe Arg His     Asn     #               24300 - #                2425     -      Gln Ile Gly Gly His Phe Phe Asn - # His Phe Gln Lys Arg Ser His     Arg     #           24450     -      Val Thr Arg Phe Tyr Leu Arg Phe - # Glu Phe Cys Tyr Pro Ile Gln     Ile     #       24605     -      Leu Phe Ser His Tyr Lys Glu Asn - # Arg Tyr Asn Gln Gly Leu Trp     Ser     #  24805     -      Tyr Phe Phe Phe His Ala Ser Leu - # Glu Ala Lys Glu Arg Lys Phe     Leu     # 24950     -      Tyr Phe Phe Ser Asp His Arg Ala - # Arg Phe Ile Arg His Ser Gln     Thr     #               25100 - #                2505     -      Arg Tyr Glu Lys Ala Phe Cys Arg - # Gln Phe Arg Val Ser Lys Asp     Tyr     #           25250     -      Ser Gln Ala Lys Leu Gln Gly Asp - # Arg Ala His Arg Arg Ala Val     Glu     #       25405     -      Gln Ile Pro Pro His Ile Lys Arg - # Ile His Arg Trp Arg Tyr Leu     Leu     #  25605     -      Gln Thr Met Arg Ile Leu Gln Phe - # Ser Cys Gly Asn Glu Ser Pro     Phe     # 25750     -      Tyr Leu Gln Pro Lys Arg Leu Val - # Ser Gln Gly Lys Asn Ala Trp     Trp     #               25900 - #                2585     -      Gly Glu Arg Ser Leu Phe Cys Gln - # Ala Trp Lys Asn Leu Lys Arg     Phe     #           26050     -      Glu Lys Ile Phe Arg Tyr Asp Ile - # Arg Pro Ile Thr Lys Arg Phe     Val     #       26205     -      Asn Gln Arg Pro Arg Gln Arg Ile - # Tyr Phe His Ala Cys Leu Phe     Ala     #  26405     -      Lys Leu Thr Gly Phe Lys Gly Gln - # Thr Gly Phe His Leu Lys Asn     Tyr     # 26550     -      Tyr Trp Arg Leu Arg Thr Gly His - # Asp Phe Arg Leu Leu Glu Arg     Ile     #               26700 - #                2665     -      Ser Gly Trp Tyr Tyr Ala Arg Gln - # Leu Trp Phe Phe Ser Ala Val     Cys     #           26850     -      Leu Lys Asn Glu Thr Lys Arg Gly - # Met Arg Leu Ser Arg Arg Phe     Phe     #       27005     -      Ser Gln Arg Ser Asp Phe Leu Tyr - # Gly Lys Leu Leu Phe Cys Gln     Thr     #  27205     -      Leu Leu Phe Tyr His Ala Ser Phe - # Tyr Phe Cys Asn Phe Phe Cys     Phe     # 27350     -      His Arg Tyr Glu Gly Tyr Glu Phe - # Trp Gln Ser Arg Glu Arg Leu     Gly     #               27500 - #                2745     -      Gln Cys Asp Asp Ala Lys Lys Arg - # Asp Gln Phe Ala Phe Leu Ser     Glu     #           27650     -      Leu Pro His Ala His Phe Trp Phe - # Ser Phe Ser Arg Thr His Phe     Asp     #       27805     -      Thr Trp Leu Asn Arg Phe Arg Asp - # Ser Val Tyr Val Asn Asp Leu     Glu     #  28005     -      Arg Tyr Gly Thr Ile Cys Leu Phe - # Ser Cys Cys Gln Gln Arg Gln     Ala     # 28150     -      Lys Ala His Tyr Gly Leu Tyr Gly - # Arg Leu Trp Arg Leu Trp Glu     Tyr     #               28300 - #                2825     -      Cys Arg Asn Gly Trp Gly Val Cys - # Gln Asp Ala Arg Asp Arg Tyr     Arg     #           28450     -      Ile Lys Ser Phe Cys Leu Gly Gly - # Leu Cys Ala Lys Asn Lys Cys     Asn     #       28605     -      Asn Gly Ala Lys Thr Asn Ser Tyr - # Phe Cys Cys Gln Arg Ala Cys     Glu     #  28805     -      Lys Leu Ser Asn Lys Arg Lys Arg - # Lys Trp Gln Gln His Leu Phe     Pro     # 28950     -      Ser Arg Phe Tyr Ala Cys Ser Ala - # Val Thr Asn Leu Ile Thr Gly     Asn     #               29100 - #                2905     -      Met Asn Leu Asp Tyr Pro Ile Thr - # Gln Leu Ile Asn Ala Phe Gly     Lys     #           29250     -      Asp His Asn Asp Pro Asn Gly Leu - # Val Ala Arg Leu Ala Pro Phe     Cys     #       29405     -      Lys Ser Thr Asn Gly Glu Phe Gln - # Trp Leu Phe Asp Asn Lys Ala     Thr     #  29605     -      Asp Arg Leu Asp Phe Ser Lys Thr - # Ile Ile Gly Val Asp Gly Ser     Ser     # 29750     -      Phe Leu Asp Asn Asn Asp Val Ser - # Pro Phe Ile Cys Phe Tyr Leu     Phe     #               29900 - #                2985     -      Ala Arg Ile Gln Glu Ala Met Asp - # Gly Arg Arg Phe Val Leu Asp     Ile     #           30050     -      Asp Glu Ala Trp Lys Tyr Leu Ala - # Ile Gln Arg Ser Leu Ile Leu     Glu     #       30205     -      Thr Cys Lys Leu Gln Gly Lys Glu - # Thr Leu Leu Ser Asp Leu Arg     Leu     #  30405     -      Lys Ala Ser Leu Ile Phe Trp Leu - # Ala Leu Leu Leu Ile Arg Leu     Glu     # 30550     -      Asn Asn Ala Leu Gln Arg Phe Phe - # Glu Thr Met Gly Ala Ile Phe     Leu     #               30700 - #                3065     -      Ile Thr Lys Asp Trp Leu Met Leu - # Gln Lys Lys Asn Leu Lys Ser     Ser     #           30850     -      Leu Arg Asp Ile Gly Lys Ser Ser - # Thr Asn Arg Met Glu Ala Leu     Ala     #       31005     -      Leu Ser Leu Val Leu Ile Glu Ala - # Phe Leu Lys Asn Ile Lys Phe     Tyr     #  31205     -      Pro Gln Ile Leu Tyr Leu Ser Lys - # Lys Leu Ile Arg Leu Ser Lys     Thr     # 31350     -      Ile Val Ser Ile Asn Ile Arg Pro - # Asp Lys Cys Ile Asn Lys Arg     Ser     #               31500 - #                3145     -      Ile Lys Met Lys Gln Asn Leu Arg - # Glu Gln Lys Leu Trp Lys Phe     Lys     #           31650     -      Met Met Ser Arg Phe Trp Ile Val - # Phe Leu Ile Ile Phe Leu Asn     Lys     #       31805     -      Lys Asn Trp Thr Ser Lys Lys Lys - # Trp Lys Ala Lys Ser Leu Asn     Lys     #  32005     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 5599 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:9: (xi) SEQUENCE DESCRIPTION: SEQ     - AAAAATCAAA GAGCTTATGG ATCATAGAGC TAAAGTTCTT TCAGACTTAG AA - #AACAAATA       60     - CAAAAAAGAA AAAGAGGCTC TAGAGAAAGA GACAAGAGGT AAAATCCTTA CT - #GCTAAGTC      120     - AAAGGCTTAT GGGGATCTAG AGCAAGCCTT AAAAGATAAC CCTCTCTATA AG - #AAACTTCT      180     - TCCTAACCCT TATGCCTATG TTTTAAACCA AGAAACATTC ACCAAAGAAG AT - #AAGGAGCG      240     - TTTGAGTTAT TACTACCCCC AGGTGAAAAC GAGCAGTATT TTTAAAAAAA CT - #ACCGCTAC      300     - CACTAAAGAT AAGGCTCAGG CTTTGCTTCA AATGGGTGTG TTTTCTTTAG AT - #GAAGAACA      360     - AAACAAAAAA GCGAGCCGAT TAGCTTTATC TTACAAGCAA GCGATTGAAG AA - #TATTCCAA      420     - TAACATTTCT AATCTGTTGA GCAGAAAAGA ATTGGATAAT ATAGATTATT AC - #TTACAGCT      480     - TGAAAGAAAC AAGTTTGACT CCAAAGCAAA AGATATTGCT CAAAAGGCTA CT - #AACACGCT      540     - TATTTTTAAC TCGGAACGCT TGGCGTTTAG CATGGCGATT GATAAGATTA AT - #GAGAAATA      600     - CTTAAAGGGC TATGAGGCTT TTTCTAACTT GTTGAAAAAT GTCAAAGATG AT - #GTGGAATT      660     - GAATACTCTG ACTAAAAACT TTACCAATCA AAAATTGAGT TTCGCACAAA AA - #CAAAAATT      720     - GTGTTTGTTG GTTTTAGACA GCTTCAATTT TGATACCCAA TCCAAAAAAT CT - #ATATTAAA      780     - AAAGACTAAT GAATACAATA TTTTCGTAGA TAGCGATCCT ATGATGAGCG AC - #AAAACAAC      840     - TATGCAAAAA GAACACTACA AGATATTTAA TTTCTTCAAA ACAGTGGTTT CT - #GCATACCG      900     - AAACAATGTT GCCAAGAATA ATCCCTTTGA ATAGGAAAGG AGACACTCTT GA - #AAAGCATC      960     - TTCAAAAAAC TAGGTTCTGT CGCTCTTTAT TCTTTAGTTG TTTATGGGGG CT - #TAAACGCT     1020     - ATCAATACAG CATTATTGCC GAGTGAATAC AAAGAATTAG TGGCTTTGGG CT - #TTAAAAAA     1080     - ATCAAAACAC TCTATCAAAG ACATGATGAC AAAGAAATTA CAAAAGAGGA AA - #AAGAATTC     1140     - GCCACTAACG CTTTGAGAGA AAAATTACGA AATGATAGGG CGAGAGCAGA GC - #AAATTCAA     1200     - AAGAATATTG AAGCGTTTGA AAAAAAGAAC AACTCTTCTG TTCAAAAAAA AG - #CGGCTAAG     1260     - CACAAAGGAT TACAAGAATT AAACGAAATT AACGCTAACC CTTTGAATGA CA - #ACCCTAAT     1320     - GGCAATTCTT CCACTGAAAC CAAATCTAAT AAAGATGATA ACTTTGATGA GA - #TGATCAAT     1380     - AAGGTGAATG AATCTTTTGT GAAACCTGCT GCTCCGCTTG TGCCTGATGA GT - #GGAGAACG     1440     - CCTGAAATTG AAATCATTAT CAATGAGTGT ATTATTTCAA GCAACGATTA TG - #ATGGGTTA     1500     - AGAAAGTGTT TGATCAAAGA CATCAAGGAT CAAAAAATTC TTGCCCCCTT AT - #TAGAAAAA     1560     - ATTCAAGAAA TAGAGACAGA AAATAACAAG TTTTCTAGAC AACACCTAAG TG - #GTTTAAAA     1620     - CTCACTCTTA ATAACAGCAA CAATAGAACC TTTCTTATAG CTTCGTGCGC TA - #TTTGTGAG     1680     - AAGAGAAAAA AAGAAATGGA GCAAGAAAAT AACTACCAGG ATACTACAAA TG - #CAAGCGAG     1740     - TTTGGCACTA CTGATACAAA AGAAAATGAA GCAAAAGATA CAGCATTCTC AA - #ACAATCGC     1800     - TCTAAATCCG AACTGCCCAA TAGCGTCATT AATCAAATAG AACAAAGCAT CG - #CTCATGGA     1860     - AAAAAATAGC GATCCAAATT ATTAGATCAA AAAACAACTA GAGAAGCAAA TC - #CCAAAGGT     1920     - TAGAAATCAT AGCCTATCGT CTCAGAAAAA TCATTTAACA ATGATCTTAC TT - #GATTGCCT     1980     - TTCTTGTAGG TATTGTCGCT TACTTTGTTC TAGGGATCTT TCTAATGCGT CC - #AACTCCTC     2040     - TAAATAATTT AAAAAGACCT TGTTTTGAGC TAACATAAGC TTTCTGATTC CT - #TTGATGAA     2100     - ATTTTTATTC TTTAGGCTTT CTACAAGCGT CTGTGAAGCA GTGATTAAAG AA - #GCTGTACC     2160     - TCCAATGTTG CTCTGATACG CCTTTAGGGA AGTTTCTAAA CGCTCTCTTA TA - #TTTTGTTT     2220     - TTCTTGCTCG ATTTTCAGCT TCCCTTCACA ATAAAGAACT AAAACTTTAT CG - #GATATTCC     2280     - GCATTGCTGC TCAGCAGTAT TTTGGTCTAA GGGATTGATT TTCATATAGG TT - #AATAAAAG     2340     - TTCAGGGCTA GACATATAAG TCTTGAAAAT CACATCTTCT GAGATGAAAA AT - #AACTCATT     2400     - CGCTTCAAAA TTGGCTTTCA ATAACGCTAA ATCTCCTCTC AAAGCAATGG CC - #GCTTTTTT     2460     - GATGTTTAGA GCATCTTCTT GACCTATTTC ATTATTAGCG CTAGGGCTAG TG - #GTTGAAAA     2520     - AATCTCATCT AAGTTTTTAA GCACTTGTTG GTTGGTCTCT TGGTAGGTGC TA - #TCAAGTTG     2580     - CTTTAAACCG CTTGTTATAT CTTCTGCCAT CAAAACAGAC AATAGCAAAA AA - #GAAGATAT     2640     - GGTATTTTTC ACGAGTGTTT TCATTTGACA ATAACTTTAG AGCTAGCAAT GT - #TTCTTGCT     2700     - GTCGTTTCTC TTTCTAATTT CAGTTGTTCT TCCCAAAGGT CGGCTTTTTT TT - #CAAGATTC     2760     - TCTATATAGT TTAAATGATT TTCTGCGTTT AAGATCGCAA CTTCTATGAG CG - #CATTCAAA     2820     - TCTACTGATC CTTTTAAGGT TTTGATTTCT CCATTGATCC CATTCAAATA AG - #CGATATTT     2880     - TGAAAATCTG CATCACTCAG TTTATTTTGA ATAAGGGCTA CAATCATTCT GT - #AATTCTGA     2940     - ATAACCTGTT CCATAAGGCA TGCTGAAATT TTTAGCCCAT CAAGATAAGG GC - #ATTTTGTG     3000     - GGCGCTAGAG TGAATGTTTC AATGATTCCA AATGGTCGCC CATGCTTGAA AA - #AAAACTAA     3060     - GAGCAGGCGC ATAGATGGCA CTTTGAAACA AAGCCTGACC TGTTAGGGAA TT - #ATAATCAA     3120     - TAAGGGTCGC TTTTTGCATA GCTGTTTTCA ACCATGTCTC AAAACCTTTT AA - #GGTTTCTT     3180     - CAAACGCCTT GATACCAATC GTATTGTAAG CGATGTATTG AGCGTTGTCA GA - #AGAACTTC     3240     - CTAGAGCTTG AGAAATTTCC ATTTGTGTTT TTAGGGTAAC CCTCGGTTCA AA - #GCTGTTTT     3300     - TTAACGCTTC TAAGAGAGCG TTTTGCTGGT TCATTTTGAG CTTGATCATT TC - #GTTATTTT     3360     - TTTGGAGCGC GATTTGCATG TTTTGGATTT CTGTTTGGGT ATTAATTTTT TG - #TTTTTCCA     3420     - CGATCATTTT GACATTCCCC CCCAATGCAC TAAGCGCCGC TTGAATACCC TT - #CCATGACG     3480     - CCAAGCAAGA TGTCTGAACC TGCAAAAAAC CCCCCTGTCA TGCCATTGAC AC - #CATTAATA     3540     - ACGCCATTAG CCCCTTTTAA CATAGCGCTC ATGGTTGCAA GCTGAGTCCT CA - #ATTCTCCC     3600     - TCTATTTGCG CTTGAATGGC TTTTTCTTTG GCACTAGATT GAGCTTCTAT GG - #CTTTTAAT     3660     - TCGGCTTGAG CGGTTTTTTG TTTGGCTTGT GCGTCTGCCT GAATGGCTTT TA - #AGGCAGGT     3720     - TCAAGCGTTA TTACTACCTC TGTACCATTC AGAGACAAAC CACAAAAAGT CA - #AGAAAGAA     3780     - AATATGCTTA AAAAACATTT CACATCTCTT TCCTCACTTC ACGATTATTT TA - #GTTTGCAC     3840     - CCTTTCTGTT AAGTAGCTAT CTTTTTGCCC CTTAAGCTTG TCTTTGATGT AA - #TCAAGGTA     3900     - AGTCAAATGC GATTTCAAAA AAGATTTATT CGCTACTATA TTGTAATTAT AT - #AGCGAACT     3960     - TATGTTAGAA ATCGCTTGAG TGTCATAGGT GCTAGTAGCT AATCCTGATT GA - #TTAAGTAT     4020     - CATTTGAGAA GCGTTCTGCA ACAAATTGGT ATTATTTTTC ACAAATTCTA TA - #TAGTATTC     4080     - TCTCAAAATT TCTGCTACTT TTTCAGCATA GCAATAAACA GCAAGAACCT TG - #TCCCCAAT     4140     - AGGGCATGCA GGAGTGGTCA TAGGATTAAC GCCTGAAGTT AGGGCATTAG TG - #GCTAACGC     4200     - TTGGTATTTA GCATAAACAG TGGGCATAGA AACGCTCATG GGGCGTCATA GA - #AATTTGCA     4260     - TGCAACTGAA AAACACTTTT GATGAGCCAA CAAGCGCACC TAAAGCGGTA CA - #GCTATCAA     4320     - GGAATCGGTG TATCATTCAT TGAGCTGTTG CTTGCTTGAG AACGCAGTTG CT - #CTTGTAGA     4380     - GCTAGGGCGT ATTTTGGTGC TGCACTTGTA ATATTGCCTA ATATACCGTC AT - #CATTTCAA     4440     - CCGTTGTTGG CACGCTAGGA ACAGCGATTT GATTTGTCGC ATAAGCTTCA AT - #AGCACTGG     4500     - GATTTTTAGG GGTGGTGTTA CTCGCTAAAA TGCTTGCAAT CTGACTATTA AC - #AGCACCAA     4560     - TTTGCGCGCC TTGCGTGTTG CCTTGTGCGT TAAATTCCCC TGTTAATTTG CT - #AATATTTA     4620     - AGATATTGTT CCCCACAGCC ATGCTTTGAT CGTTAAAACC TTGAAACAAT TG - #GTTGTATT     4680     - GTTGGTTAGC GGCTTTCATA GGCATGCTTA CGGCTTCAGC GATGCTTTGA TT - #GTATTGGG     4740     - TCATGATAGC GGTCATTTGC GGATTAGTAA ACCCAACAAT AATAGGAATA AT - #CGCTGCTG     4800     - TCATAGCACC CGCTACTATT CCTGCAAATG GTCCTGCGAC ACCACTTGTG TT - #GAGATGAT     4860     - TGAGGAAACT TCCGATAAGA AGCCTGCAGA AGATGATTCA TATATAGCTT GT - #GTACCTGC     4920     - CATGTTAACA CCCCCTAGTT AATACCCTAA TATCGGTGGT AAAAACGATG AA - #TCTGAGTA     4980     - TGTTGGTGCA TAACCATACA TGAAAGGATT GTTTGGACCG TAATCGCCCA TC - #ATTTGGCT     5040     - CATGAGAAGA TTTTGAATGC CCCACATCGC ATTGATACCT AGATTATCAT TA - #GGTTGAAA     5100     - ACTCCCTAAA CTTATGTCGT CAAATTTGAT ATTAACATTT TTATCATTAT AG - #TCATTGAG     5160     - TATGGCCACT TTTTGCTCTA GGGTTTCTTT AGGGATCTCT ATTTTTAGTT GA - #TCTCTAGA     5220     - AACAAGCCCC ACGCTATTTA GTGCCATATC TTCAGGACTA ATATCTTTTA TA - #TCAGTGTT     5280     - TTGGTCAGCG TTAACGGACT GTAAACATGC CAATGATAAG ACACCAAGCA AA - #TAGTAATT     5340     - TAATTTTATA AAAATCCGTT TTCATACTTT TGACTCCTTT ATTCTTATTT TT - #AGCACTAT     5400     - TCTAGCGCAT TAACGCCACT CAATCGTTAT TTTTGTTTTG ATTTTTTTGA TC - #GAGCATTT     5460     - TGTTTGTTAC TTCATCAATG TTTTGAAAAT ATTTTTCAAA AAGCTCTTTC TT - #TTTAGCTT     5520     - CAACGCTCAT ATCAATCTGA ATCCAATTAG GAATAATGGA GTCCATGATT AA - #ATGCATGA     5580     #                 559 - #9     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1727 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:10:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Asn Gln Arg Ala Tyr Gly Ser - # Ser Ser Ser Phe Arg Leu Arg     Lys     #   15     -      Gln Ile Gln Lys Arg Lys Arg Gly - # Ser Arg Glu Arg Asp Lys Arg     Asn     #                 30     -      Pro Tyr Cys Val Lys Gly Leu Trp - # Gly Ser Arg Ala Ser Leu Lys     Arg     #             45     -      Pro Ser Leu Glu Thr Ser Ser Pro - # Leu Cys Leu Cys Phe Lys Pro     Arg     #         60     -      Asn Ile His Gln Arg Arg Gly Ala - # Phe Glu Leu Leu Leu Pro Pro     Gly     #     80     -      Glu Asn Glu Gln Tyr Phe Lys Asn - # Tyr Arg Tyr His Arg Gly Ser     Gly     #   95     -      Phe Ala Ser Asn Gly Cys Val Phe - # Phe Arg Arg Thr Lys Gln Lys     Ser     #                110     -      Glu Pro Ile Ser Phe Ile Leu Gln - # Ala Ser Asp Arg Ile Phe Gln     His     #            125     -      Phe Ser Val Glu Gln Lys Arg Ile - # Gly Tyr Arg Leu Leu Leu Thr     Ala     #        140     -      Lys Lys Gln Val Leu Gln Ser Lys - # Arg Tyr Cys Ser Lys Gly Tyr     His     #    160     -      Ala Tyr Phe Leu Gly Thr Leu Gly - # Val His Gly Asp Asp Glu Ile     Leu     #   175     -      Lys Gly Leu Gly Phe Phe Leu Val - # Glu Lys Cys Gln Arg Cys Gly     Ile     #                190     -      Glu Tyr Ser Asp Lys Leu Tyr Gln - # Ser Lys Ile Glu Phe Arg Thr     Lys     #            205     -      Thr Lys Ile Val Phe Val Gly Phe - # Arg Gln Leu Gln Phe Tyr Pro     Ile     #        220     -      Gln Lys Ile Tyr Ile Lys Lys Asp - # Ile Gln Tyr Phe Arg Arg Arg     Ser     #    240     -      Tyr Asp Glu Arg Gln Asn Asn Tyr - # Ala Lys Arg Thr Leu Gln Asp     Ile     #   255     -      Phe Leu Gln Asn Ser Gly Phe Cys - # Ile Pro Lys Gln Cys Cys Gln     Glu     #                270     -      Ser Leu Ile Gly Lys Glu Thr Leu - # Leu Lys Ser Ile Phe Lys Lys     Leu     #            285     -      Gly Ser Val Ala Leu Tyr Ser Leu - # Val Val Tyr Gly Gly Leu Asn     Ala     #        300     -      Ile Asn Thr Ala Leu Leu Pro Ser - # Glu Tyr Lys Glu Leu Val Ala     Leu     #    320     -      Gly Phe Lys Lys Ile Lys Thr Leu - # Tyr Gln Arg His Asp Asp Lys     Glu     #   335     -      Ile Thr Lys Glu Glu Lys Glu Phe - # Ala Thr Asn Ala Leu Arg Glu     Lys     #                350     -      Leu Arg Asn Asp Arg Ala Arg Ala - # Glu Gln Ile Gln Lys Asn Ile     Glu     #            365     -      Ala Phe Glu Lys Lys Asn Asn Ser - # Ser Val Gln Lys Lys Ala Ala     Lys     #        380     -      His Lys Gly Leu Gln Glu Leu Asn - # Glu Ile Asn Ala Asn Pro Leu     Asn     #    400     -      Asp Asn Pro Asn Gly Asn Ser Ser - # Thr Glu Thr Lys Ser Asn Lys     Asp     #   415     -      Asp Asn Phe Asp Glu Met Ile Asn - # Lys Val Asn Glu Ser Phe Val     Lys     #                430     -      Pro Ala Ala Pro Leu Val Pro Asp - # Glu Trp Arg Thr Pro Glu Ile     Glu     #            445     -      Ile Ile Ile Asn Glu Cys Ile Ile - # Ser Ser Asn Asp Tyr Asp Gly     Leu     #        460     -      Arg Lys Cys Leu Ile Lys Asp Ile - # Lys Asp Gln Lys Ile Leu Ala     Pro     #    480     -      Leu Leu Glu Lys Ile Gln Glu Ile - # Glu Thr Glu Asn Asn Lys Phe     Ser     #   495     -      Arg Gln His Leu Ser Gly Leu Lys - # Leu Thr Leu Asn Asn Ser Asn     Asn     #                510     -      Arg Thr Phe Leu Ile Ala Ser Cys - # Ala Ile Cys Glu Lys Arg Lys     Lys     #            525     -      Glu Met Glu Gln Glu Asn Asn Tyr - # Gln Asp Thr Thr Asn Ala Ser     Glu     #        540     -      Phe Gly Thr Thr Asp Thr Lys Glu - # Asn Glu Ala Lys Asp Thr Ala     Phe     #    560     -      Ser Asn Asn Arg Ser Lys Ser Glu - # Leu Pro Asn Ser Val Ile Asn     Gln     #   575     -      Ile Glu Gln Ser Ile Ala His Gly - # Lys Lys Arg Ser Lys Leu Leu     Asp     #                590     -      Gln Lys Thr Thr Arg Glu Ala Asn - # Pro Lys Gly Lys Ser Pro Ile     Val     #            605     -      Ser Glu Lys Ser Phe Asn Asn Asp - # Leu Thr Leu Pro Phe Leu Val     Leu     #        620     -      Ser Leu Thr Leu Phe Gly Ser Phe - # Cys Val Gln Leu Leu Ile Ile     Lys     #    640     -      Asp Leu Val Leu Ser His Lys Leu - # Ser Asp Ser Phe Asp Glu Ile     Phe     #   655     -      Ile Leu Ala Phe Tyr Lys Arg Leu - # Ser Ser Asp Arg Ser Cys Thr     Ser     #                670     -      Asn Val Ala Leu Ile Arg Leu Gly - # Ser Phe Thr Leu Ser Tyr Ile     Leu     #            685     -      Phe Phe Leu Leu Asp Phe Gln Leu - # Pro Phe Thr Ile Lys Asn Asn     Phe     #        700     -      Ile Gly Tyr Ser Ala Leu Leu Leu - # Ser Ser Ile Leu Val Gly Ile     Asp     #    720     -      Phe His Ile Gly Lys Phe Arg Ala - # Arg His Ile Ser Leu Glu Asn     His     #   735     -      Ile Phe Asp Glu Lys Leu Ile Arg - # Phe Lys Ile Gly Phe Gln Arg     Ile     #                750     -      Ser Ser Gln Ser Asn Gly Arg Phe - # Phe Asp Val Ser Ile Phe Leu     Thr     #            765     -      Tyr Phe Ile Ile Ser Ala Arg Ala - # Ser Gly Lys Asn Leu Ile Val     Phe     #        780     -      Lys His Leu Leu Val Gly Leu Leu - # Val Gly Ala Ile Lys Leu Leu     Thr     #    800     -      Ala Cys Tyr Ile Phe Cys His Gln - # Asn Arg Gln Gln Lys Arg Arg     Tyr     #   815     -      Gly Ile Phe His Glu Cys Phe His - # Leu Thr Ile Thr Leu Glu Leu     Ala     #                830     -      Met Phe Leu Ala Val Val Ser Leu - # Ser Asn Phe Ser Cys Ser Ser     Gln     #            845     -      Arg Ser Ala Phe Phe Ser Arg Phe - # Ser Ile Phe Lys Phe Ser Ala     Phe     #        860     -      Lys Ile Ala Thr Ser Met Ser Ala - # Phe Lys Ser Thr Asp Pro Phe     Lys     #    880     -      Val Leu Ile Ser Pro Leu Ile Pro - # Phe Lys Ala Ile Phe Lys Ser     Ala     #   895     -      Ser Leu Ser Leu Phe Ile Arg Ala - # Thr Ile Ile Leu Phe Ile Thr     Cys     #                910     -      Ser Ile Arg His Ala Glu Ile Phe - # Ser Pro Ser Arg Gly His Phe     Val     #            925     -      Gly Ala Arg Val Asn Val Ser Met - # Ile Pro Asn Gly Arg Pro Cys     Leu     #        940     -      Lys Lys Asn Glu Gln Ala His Arg - # Trp His Phe Glu Thr Lys Pro     Asp     #    960     -      Leu Leu Gly Asn Tyr Asn Gln Gly - # Ser Leu Phe Ala Leu Phe Ser     Thr     #   975     -      Met Ser Gln Asn Leu Leu Arg Phe - # Leu Gln Thr Pro Tyr Gln Ser     Tyr     #                990     -      Cys Lys Arg Cys Ile Glu Arg Cys - # Gln Lys Asn Phe Leu Glu Leu     Glu     #           10050     -      Lys Phe Pro Phe Val Phe Leu Gly - # Pro Ser Val Gln Ser Cys Phe     Leu     #       10205     -      Thr Leu Leu Arg Glu Arg Phe Ala - # Gly Ser Phe Ala Ser Phe Arg     Tyr     #  10405     -      Phe Phe Gly Ala Arg Phe Ala Cys - # Phe Gly Phe Leu Phe Gly Tyr     Phe     # 10550     -      Phe Val Phe Pro Arg Ser Phe His - # Ser Pro Pro Met His Ala Pro     Leu     #               10700 - #                1065     -      Glu Tyr Pro Ser Met Thr Pro Ser - # Lys Met Ser Glu Pro Ala Lys     Asn     #           10850     -      Pro Pro Val Met Pro Leu Thr Pro - # Leu Ile Thr Pro Leu Ala Pro     Phe     #       11005     -      Asn Ile Ala Leu Met Val Ala Ser - # Val Leu Asn Ser Pro Ser Ile     Cys     #  11205     -      Ala Met Ala Phe Ser Leu Ala Leu - # Asp Ala Ser Met Ala Phe Asn     Ser     # 11350     -      Ala Ala Val Phe Cys Leu Ala Cys - # Ala Ser Ala Met Ala Phe Lys     Ala     #               11500 - #                1145     -      Gly Ser Ser Val Ile Thr Thr Ser - # Val Pro Phe Arg Asp Lys Pro     Gln     #           11650     -      Lys Val Lys Lys Glu Asn Met Leu - # Lys Lys His Phe Thr Ser Leu     Ser     #       11805     -      Ser Leu His Asp Tyr Phe Ser Leu - # His Pro Phe Cys Val Ala Ile     Phe     #  12005     -      Leu Pro Leu Lys Leu Val Phe Asp - # Val Ile Lys Val Ser Gln Met     Arg     # 12150     -      Phe Gln Lys Arg Phe Ile Arg Tyr - # Tyr Ile Val Ile Ile Arg Thr     Tyr     #               12300 - #                1225     -      Val Arg Asn Arg Leu Ser Val Ile - # Gly Ala Ser Ser Ser Leu Ile     Lys     #           12450     -      Tyr His Leu Arg Ser Val Leu Gln - # Gln Ile Gly Ile Ile Phe His     Lys     #       12605     -      Phe Tyr Ile Val Phe Ser Gln Asn - # Phe Cys Tyr Phe Phe Ser Ile     Ala     #  12805     -      Ile Asn Ser Lys Asn Leu Val Pro - # Asn Arg Ala Cys Arg Ser Gly     His     # 12950     -      Arg Ile Asn Ala Ser Gly Ile Ser - # Gly Arg Leu Val Phe Ser Ile     Asn     #               13100 - #                1305     -      Ser Gly His Arg Asn Ala His Gly - # Ala Ser Lys Phe Ala Cys Asn     Lys     #           13250     -      Thr Leu Leu Met Ser Gln Gln Ala - # His Leu Lys Arg Tyr Ser Tyr     Gln     #       13405     -      Gly Ile Gly Val Ser Phe Ile Glu - # Leu Leu Leu Ala Glu Arg Ser     Cys     #  13605     -      Ser Cys Arg Ala Arg Ala Tyr Phe - # Gly Ala Ala Leu Val Ile Leu     Pro     # 13750     -      Asn Ile Pro Ser Ser Phe Gln Pro - # Leu Leu Ala Arg Glu Gln Arg     Phe     #               13900 - #                1385     -      Asp Leu Ser His Lys Leu Gln His - # Trp Asp Phe Gly Trp Cys Tyr     Ser     #           14050     -      Leu Lys Cys Leu Gln Ser Asp Tyr - # Gln His Gln Phe Ala Arg Leu     Ala     #       14205     -      Cys Cys Leu Val Arg Ile Pro Leu - # Leu Ile Cys Tyr Leu Arg Tyr     Cys     #  14405     -      Ser Pro Gln Pro Cys Phe Asp Arg - # Asn Leu Glu Thr Ile Gly Cys     Ile     # 14550     -      Val Gly Arg Leu Ser Ala Cys Leu - # Arg Leu Gln Arg Cys Phe Asp     Cys     #               14700 - #                1465     -      Ile Gly Ser Arg Ser Phe Ala Asp - # Thr Gln Gln Glu Ser Leu Leu     Ser     #           14850     -      His Pro Leu Leu Phe Leu Gln Met - # Val Leu Arg His His Leu Cys     Asp     #       15005     -      Asp Gly Asn Phe Arg Glu Ala Cys - # Arg Arg Phe Ile Tyr Ser Leu     Cys     #  15205     -      Thr Cys His Val Asn Thr Pro Leu - # Ile Pro Tyr Arg Trp Lys Arg     Ile     # 15350     -      Val Cys Trp Cys Ile Thr Ile His - # Glu Arg Ile Val Trp Thr Val     Ile     #               15500 - #                1545     -      Ala His His Leu Ala His Glu Lys - # Ile Leu Asn Ala Pro His Arg     Ile     #           15650     -      Asp Thr Ile Ile Ile Arg Leu Lys - # Thr Pro Thr Tyr Val Val Lys     Phe     #       15805     -      Asp Ile Asn Ile Phe Ile Ile Ile - # Val Ile Glu Tyr Gly His Phe     Leu     #  16005     -      Leu Gly Phe Phe Arg Asp Leu Tyr - # Phe Leu Ile Ser Arg Asn Lys     Pro     # 16150     -      His Ala Ile Cys His Ile Phe Arg - # Thr Asn Ile Phe Tyr Ile Ser     Val     #               16300 - #                1625     -      Leu Val Ser Val Asn Gly Leu Thr - # Cys Gln Asp Thr Lys Gln Ile     Val     #           16450     -      Ile Phe Tyr Lys Asn Pro Phe Ser - # Tyr Phe Leu Leu Tyr Ser Tyr     Phe     #       16605     -      His Tyr Ser Ser Ala Leu Thr Pro - # Leu Asn Arg Tyr Phe Cys Phe     Asp     #  16805     -      Phe Phe Asp Arg Ala Phe Cys Leu - # Leu Leu His Gln Cys Phe Glu     Asn     # 16950     -      Ile Phe Gln Lys Ala Leu Ser Phe - # Leu Gln Arg Ser Tyr Gln Ser     Glu     #               17100 - #                1705     -      Ser Asn Glu Trp Ser Pro Leu Asn - # Ala Ser His Arg His Asp Phe     #           17250     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1781 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:11:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Ile Lys Glu Leu Met Asp His - # Arg Ala Lys Val Leu Ser Asp     Leu     #   15     -      Glu Asn Lys Tyr Lys Lys Glu Lys - # Glu Ala Leu Glu Lys Glu Thr     Arg     #                 30     -      Gly Lys Ile Leu Thr Ala Lys Ser - # Lys Ala Tyr Gly Asp Leu Glu     Gln     #             45     -      Ala Leu Lys Asp Asn Pro Leu Tyr - # Lys Lys Leu Leu Pro Asn Pro     Tyr     #         60     -      Ala Tyr Val Leu Asn Gln Glu Thr - # Phe Thr Lys Glu Asp Lys Glu     Arg     #     80     -      Leu Ser Tyr Tyr Tyr Pro Gln Val - # Lys Thr Ser Ser Ile Phe Lys     Lys     #   95     -      Thr Thr Ala Thr Thr Lys Asp Lys - # Ala Gln Ala Leu Leu Gln Met     Gly     #                110     -      Val Phe Ser Leu Asp Glu Glu Gln - # Asn Lys Lys Ala Ser Arg Leu     Ala     #            125     -      Leu Ser Tyr Lys Gln Ala Ile Glu - # Glu Tyr Ser Asn Asn Ile Ser     Asn     #        140     -      Leu Leu Ser Arg Lys Glu Leu Asp - # Asn Ile Asp Tyr Tyr Leu Gln     Leu     #    160     -      Glu Arg Asn Lys Phe Asp Ser Lys - # Ala Lys Asp Ile Ala Gln Lys     Ala     #   175     -      Thr Asn Thr Leu Ile Phe Asn Ser - # Glu Arg Leu Ala Phe Ser Met     Ala     #                190     -      Ile Asp Lys Ile Asn Glu Lys Tyr - # Leu Lys Gly Tyr Glu Ala Phe     Ser     #            205     -      Asn Leu Leu Lys Asn Val Lys Asp - # Asp Val Glu Leu Asn Thr Leu     Thr     #        220     -      Lys Asn Phe Thr Asn Gln Lys Leu - # Ser Phe Ala Gln Lys Gln Lys     Leu     #    240     -      Cys Leu Leu Val Leu Asp Ser Phe - # Asn Phe Asp Thr Gln Ser Lys     Lys     #   255     -      Ser Ile Leu Lys Lys Thr Asn Glu - # Tyr Asn Ile Phe Val Asp Ser     Asp     #                270     -      Pro Met Met Ser Asp Lys Thr Thr - # Met Gln Lys Glu His Tyr Lys     Ile     #            285     -      Phe Asn Phe Phe Lys Thr Val Val - # Ser Ala Tyr Arg Asn Asn Val     Ala     #        300     -      Lys Asn Asn Pro Phe Glu Glu Arg - # Arg His Ser Lys Ala Ser Ser     Lys     #    320     -      Asn Val Leu Ser Leu Phe Ile Leu - # Leu Phe Met Gly Ala Thr Leu     Ser     #   335     -      Ile Gln His Tyr Cys Arg Val Asn - # Thr Lys Asn Trp Leu Trp Ala     Leu     #                350     -      Lys Lys Ser Lys His Ser Ile Lys - # Asp Met Met Thr Lys Lys Leu     Gln     #            365     -      Lys Arg Lys Lys Asn Ser Pro Leu - # Thr Leu Glu Lys Asn Tyr Glu     Met     #        380     -      Ile Gly Arg Glu Gln Ser Lys Phe - # Lys Arg Ile Leu Lys Arg Leu     Lys     #    400     -      Lys Arg Thr Thr Leu Leu Phe Lys - # Lys Lys Arg Leu Ser Thr Lys     Asp     #   415     -      Tyr Lys Asn Thr Lys Leu Thr Leu - # Thr Leu Met Thr Thr Leu Met     Ala     #                430     -      Ile Leu Pro Leu Lys Pro Asn Leu - # Ile Lys Met Ile Thr Leu Met     Arg     #            445     -      Ser Ile Arg Met Asn Leu Leu Asn - # Leu Leu Leu Arg Leu Cys Leu     Met     #        460     -      Ser Gly Glu Arg Leu Lys Leu Lys - # Ser Leu Ser Met Ser Val Leu     Phe     #    480     -      Gln Ala Thr Ile Met Met Gly Glu - # Ser Val Ser Lys Thr Ser Arg     Ile     #   495     -      Lys Lys Phe Leu Pro Pro Tyr Lys - # Lys Phe Lys Lys Arg Gln Lys     Ile     #                510     -      Thr Ser Phe Leu Asp Asn Thr Val - # Val Asn Ser Leu Leu Ile Thr     Ala     #            525     -      Thr Ile Glu Pro Phe Leu Leu Arg - # Ala Leu Phe Val Arg Arg Glu     Lys     #        540     -      Lys Lys Trp Ser Lys Lys Ile Thr - # Thr Arg Ile Leu Gln Met Gln     Ala     #    560     -      Ser Leu Ala Leu Leu Ile Gln Lys - # Lys Met Lys Gln Lys Ile Gln     His     #   575     -      Ser Gln Thr Ile Ala Leu Asn Pro - # Asn Cys Pro Ile Ala Ser Leu     Ile     #                590     -      Lys Asn Lys Ala Ser Leu Met Glu - # Lys Asn Ser Asp Pro Asn Tyr     Ile     #            605     -      Lys Lys Gln Leu Glu Lys Gln Ile - # Pro Lys Val Arg Asn His Ser     Leu     #        620     -      Ser Ser Gln Lys Asn His Leu Thr - # Met Ile Leu Leu Asp Cys Leu     Ser     #    640     -      Cys Arg Tyr Cys Arg Leu Leu Cys - # Ser Arg Asp Leu Ser Asn Ala     Ser     #   655     -      Asn Ser Ser Lys Phe Lys Lys Thr - # Leu Phe Ala Asn Ile Ser Phe     Leu     #                670     -      Ile Pro Leu Met Lys Phe Leu Phe - # Phe Arg Leu Ser Thr Ser Val     Cys     #            685     -      Glu Ala Val Ile Lys Glu Ala Val - # Pro Pro Met Leu Leu Tyr Ala     Phe     #        700     -      Arg Glu Val Ser Lys Arg Ser Leu - # Ile Phe Cys Phe Ser Cys Ser     Ile     #    720     -      Phe Ser Phe Pro Ser Gln Arg Thr - # Lys Thr Leu Ser Asp Ile Pro     His     #   735     -      Cys Cys Ser Ala Val Phe Trp Ser - # Lys Gly Leu Ile Phe Ile Val     Asn     #                750     -      Lys Ser Ser Gly Leu Asp Ile Val - # Leu Lys Ile Thr Ser Ser Glu     Met     #            765     -      Lys Asn Asn Ser Phe Ala Ser Lys - # Leu Ala Phe Asn Asn Ala Lys     Ser     #        780     -      Pro Leu Lys Ala Met Ala Ala Phe - # Leu Met Phe Arg Ala Ser Ser     Pro     #    800     -      Ile Ser Leu Leu Ala Leu Gly Leu - # Val Val Glu Lys Ile Ser Ser     Lys     #   815     -      Phe Leu Ser Thr Cys Trp Leu Val - # Ser Trp Val Leu Ser Ser Cys     Phe     #                830     -      Lys Pro Leu Val Ile Ser Ser Ala - # Ile Lys Thr Asp Asn Ser Lys     Lys     #            845     -      Glu Asp Met Val Phe Phe Thr Ser - # Val Phe Ile Gln Leu Ser Gln     Cys     #        860     -      Phe Leu Leu Ser Phe Leu Phe Leu - # Ile Ser Val Val Leu Pro Lys     Gly     #    880     -      Arg Leu Phe Phe Gln Asp Ser Leu - # Tyr Ser Leu Asn Asp Phe Leu     Arg     #   895     -      Leu Arg Ser Gln Leu Leu Ala His - # Ser Asn Leu Leu Ile Leu Leu     Arg     #                910     -      Phe Phe Leu His Ser His Ser Asn - # Lys Arg Tyr Phe Glu Asn Leu     His     #            925     -      His Ser Val Tyr Phe Glu Gly Leu - # Gln Ser Phe Cys Asn Ser Glu     Pro     #        940     -      Val Pro Gly Met Leu Lys Phe Leu - # Ala His Gln Asp Lys Gly Ile     Leu     #    960     -      Trp Ala Leu Glu Met Phe Gln Phe - # Gln Met Val Ala His Ala Lys     Lys     #   975     -      Thr Lys Ser Arg Arg Ile Asp Gly - # Thr Leu Lys Gln Ser Leu Thr     Cys     #                990     -      Gly Ile Ile Ile Asn Lys Gly Arg - # Phe Leu His Ser Cys Phe Gln     Pro     #           10050     -      Cys Leu Lys Thr Phe Gly Phe Phe - # Lys Arg Leu Asp Thr Asn Arg     Ile     #       10205     -      Val Ser Asp Val Leu Ser Val Val - # Arg Arg Thr Ser Ser Leu Arg     Asn     #  10405     -      Phe His Leu Cys Phe Gly Asn Pro - # Arg Phe Lys Ala Val Phe Arg     Phe     # 10550     -      Glu Ser Val Leu Leu Val His Phe - # Glu Leu Asp His Phe Val Ile     Phe     #               10700 - #                1065     -      Leu Glu Arg Asp Leu His Val Leu - # Asp Phe Cys Leu Gly Ile Asn     Phe     #           10850     -      Leu Phe Phe His Asp His Phe Asp - # Ile Pro Pro Gln Cys Thr Lys     Arg     #       11005     -      Arg Leu Asn Thr Leu Pro Arg Gln - # Ala Arg Cys Leu Asn Leu Gln     Lys     #  11205     -      Thr Pro Leu Ser Cys His His His - # Arg His Pro Leu Leu Thr Arg     Ser     # 11350     -      Trp Leu Gln Ala Glu Ser Ser Ile - # Leu Pro Leu Phe Ala Leu Glu     Trp     #               11500 - #                1145     -      Leu Phe Leu Trp His Ile Glu Leu - # Leu Trp Leu Leu Ile Arg Leu     Glu     #           11650     -      Arg Phe Phe Val Trp Leu Val Arg - # Leu Pro Glu Trp Leu Leu Arg     Gln     #       11805     -      Val Gln Ala Leu Leu Leu Pro Leu - # Tyr His Ser Glu Thr Asn His     Lys     #  12005     -      Lys Ser Arg Lys Lys Ile Cys Leu - # Lys Asn Ile Ser His Leu Phe     Pro     # 12150     -      His Phe Thr Ile Ile Leu Val Cys - # Thr Leu Ser Val Lys Leu Ser     Phe     #               12300 - #                1225     -      Cys Pro Leu Ser Leu Ser Leu Met - # Ser Arg Val Lys Cys Asp Phe     Lys     #           12450     -      Lys Asp Leu Phe Ala Thr Ile Leu - # Leu Tyr Ser Glu Leu Met Leu     Glu     #       12605     -      Ile Ala Val Ser Val Leu Val Ala - # Asn Pro Asp Leu Ser Ile Ile     Glu     #  12805     -      Ala Phe Cys Asn Lys Leu Val Leu - # Phe Phe Thr Asn Ser Ile Tyr     Ser     # 12950     -      Leu Lys Ile Ser Ala Thr Phe Ser - # Ala Gln Thr Ala Arg Thr Leu     Ser     #               13100 - #                1305     -      Pro Ile Gly His Ala Gly Val Val - # Ile Gly Leu Thr Pro Glu Val     Arg     #           13250     -      Ala Leu Val Ala Asn Ala Trp Tyr - # Leu Ala Thr Val Gly Ile Glu     Thr     #       13405     -      Leu Met Gly Arg His Arg Asn Leu - # His Ala Thr Glu Lys His Phe     Ala     #  13605     -      Asn Lys Arg Thr Ser Gly Thr Ala - # Ile Lys Glu Ser Val Tyr His     Ser     # 13750     -      Leu Ser Cys Cys Leu Leu Glu Asn - # Ala Val Ala Leu Val Glu Leu     Gly     #               13900 - #                1385     -      Arg Ile Leu Val Leu His Leu Tyr - # Cys Leu Ile Tyr Arg His His     Phe     #           14050     -      Asn Arg Cys Trp His Ala Arg Asn - # Ser Asp Leu Ile Cys Arg Ile     Ser     #       14205     -      Phe Asn Ser Thr Gly Ile Phe Arg - # Gly Gly Val Thr Arg Asn Ala     Cys     #  14405     -      Asn Leu Thr Ile Asn Ser Thr Asn - # Leu Arg Ala Leu Arg Val Ala     Leu     # 14550     -      Cys Val Lys Phe Pro Cys Phe Ala - # Asn Ile Asp Ile Val Pro His     Ser     #               14700 - #                1465     -      His Ala Leu Ile Val Lys Thr Leu - # Lys Gln Leu Val Val Leu Leu     Val     #           14850     -      Ser Gly Phe His Arg His Ala Tyr - # Gly Phe Ser Asp Ala Leu Ile     Val     #       15005     -      Leu Gly His Asp Ser Gly His Leu - # Arg Ile Ser Lys Pro Asn Asn     Asn     #  15205     -      Arg Asn Asn Arg Cys Cys His Ser - # Thr Arg Tyr Tyr Ser Cys Lys     Trp     # 15350     -      Ser Cys Asp Thr Thr Cys Val Glu - # Met Ile Glu Glu Thr Ser Asp     Lys     #               15500 - #                1545     -      Lys Pro Ala Glu Asp Asp Ser Tyr - # Ile Ala Cys Val Pro Ala Met     Leu     #           15650     -      Thr Pro Pro Ser Tyr Pro Asn Ile - # Gly Gly Lys Asn Asp Glu Ser     Glu     #       15805     -      Tyr Val Gly Ala Pro Tyr Met Lys - # Gly Leu Phe Gly Pro Ser Pro     Ile     #  16005     -      Ile Trp Leu Met Arg Arg Phe Met - # Pro His Ile Ala Leu Ile Pro     Arg     # 16150     -      Leu Ser Leu Gly Lys Leu Pro Lys - # Leu Met Ser Ser Asn Leu Ile     Leu     #               16300 - #                1625     -      Thr Phe Leu Ser Leu Ser Leu Ser - # Met Ala Thr Phe Cys Ser Arg     Val     #           16450     -      Ser Leu Gly Ile Ser Ile Phe Ser - # Ser Leu Glu Thr Ser Pro Thr     Leu     #       16605     -      Phe Ser Ala Ile Ser Ser Gly Leu - # Ile Ser Phe Ile Ser Val Phe     Trp     #  16805     -      Ser Ala Leu Thr Asp Cys Lys His - # Ala Asn Asp Lys Thr Pro Ser     Lys     # 16950     -      Phe Asn Phe Ile Lys Ile Arg Phe - # His Thr Phe Asp Ser Phe Ile     Leu     #               17100 - #                1705     -      Ile Phe Ser Thr Ile Leu Ala His - # Arg His Ser Ile Val Ile Phe     Val     #           17250     -      Leu Ile Phe Leu Ile Glu His Phe - # Val Cys Tyr Phe Ile Asn Val     Leu     #       17405     -      Lys Ile Phe Phe Lys Lys Leu Phe - # Leu Phe Ser Phe Asn Ala His     Ile     #  17605     -      Asn Leu Asn Pro Ile Arg Asn Asn - # Gly Val His Asp Met His Glu     Val     # 17750     -      Ile Gly Met Ile Phe                      1780     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1720 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:12:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Ser Lys Ser Leu Trp Ile Ile - # Glu Leu Lys Phe Phe Gln Thr     Lys     #   15     -      Thr Asn Thr Lys Lys Lys Lys Arg - # Leu Arg Lys Arg Gln Glu Val     Lys     #                 30     -      Ser Leu Leu Leu Ser Gln Arg Leu - # Met Gly Ile Ser Lys Pro Lys     Ile     #             45     -      Thr Leu Ser Ile Arg Asn Phe Phe - # Leu Thr Leu Met Pro Met Phe     Thr     #         60     -      Lys Lys His Ser Pro Lys Lys Ile - # Arg Ser Val Val Ile Thr Thr     Pro     #     80     -      Arg Lys Arg Ala Val Phe Leu Lys - # Lys Leu Pro Leu Pro Leu Lys     Ile     #   95     -      Arg Leu Arg Leu Cys Phe Lys Trp - # Val Cys Phe Leu Met Lys Asn     Lys     #                110     -      Thr Lys Lys Arg Ala Asp Leu Tyr - # Leu Thr Ser Lys Arg Leu Lys     Asn     #            125     -      Ile Pro Ile Thr Phe Leu Ile Cys - # Ala Glu Lys Asn Trp Ile Ile     Ile     #        140     -      Ile Thr Tyr Ser Leu Lys Glu Thr - # Ser Leu Thr Pro Lys Gln Lys     Ile     #    160     -      Leu Leu Lys Arg Leu Leu Thr Arg - # Leu Phe Leu Thr Arg Asn Ala     Trp     #   175     -      Arg Leu Ala Trp Arg Leu Ile Arg - # Leu Met Arg Asn Thr Arg Ala     Met     #                190     -      Arg Leu Phe Leu Thr Cys Lys Met - # Ser Lys Met Met Trp Asn Ile     Leu     #            205     -      Leu Lys Thr Leu Pro Ile Lys Asn - # Val Ser His Lys Asn Lys Asn     Cys     #        220     -      Val Cys Trp Phe Thr Ala Ser Ile - # Leu Ile Pro Asn Pro Lys Asn     Leu     #    240     -      Tyr Lys Arg Leu Met Asn Thr Ile - # Phe Ser Ile Ala Ile Leu Ala     Thr     #   255     -      Lys Gln Leu Cys Lys Lys Asn Thr - # Thr Arg Tyr Leu Ile Ser Ser     Lys     #                270     -      Gln Trp Phe Leu His Thr Glu Thr - # Met Leu Pro Arg Ile Ile Pro     Leu     #            285     -      Asn Arg Lys Gly Asp Thr Leu Glu - # Lys His Leu Gln Lys Thr Arg     Phe     #        300     -      Cys Arg Ser Leu Phe Phe Ser Cys - # Leu Trp Gly Leu Lys Arg Tyr     Gln     #    320     -      Tyr Ser Ile Ile Ala Glu Ile Gln - # Arg Ile Ser Gly Phe Gly Leu     Lys     #   335     -      Asn Gln Asn Thr Leu Ser Lys Thr - # Gln Arg Asn Tyr Lys Arg Gly     Lys     #                350     -      Arg Ile Arg His Arg Phe Glu Arg - # Lys Ile Thr Lys Gly Glu Ser     Arg     #            365     -      Ala Asn Ser Lys Glu Tyr Ser Val - # Lys Lys Glu Gln Leu Phe Cys     Ser     #        380     -      Lys Lys Ser Gly Ala Gln Arg Ile - # Thr Arg Ile Lys Arg Asn Arg     Pro     #    400     -      Phe Glu Gln Pro Trp Gln Phe Phe - # His Asn Gln Ile Arg Leu Asp     Asp     #   415     -      Gln Gly Glu Ile Phe Cys Glu Thr - # Cys Cys Ser Ala Cys Ala Val     Glu     #                430     -      Asn Ala Asn Asn His Tyr Gln Val - # Tyr Tyr Phe Lys Gln Arg Leu     Trp     #            445     -      Val Lys Lys Val Phe Asp Gln Arg - # His Gln Gly Ser Lys Asn Ser     Cys     #        460     -      Pro Leu Ile Arg Lys Asn Ser Arg - # Asn Arg Asp Arg Lys Gln Val     Phe     #    480     -      Thr Thr Pro Lys Trp Phe Lys Thr - # His Ser Gln Gln Gln Asn Leu     Ser     #   495     -      Tyr Ser Phe Val Arg Tyr Leu Glu - # Glu Lys Lys Arg Asn Gly Ala     Arg     #                510     -      Lys Leu Pro Gly Tyr Tyr Lys Cys - # Lys Arg Val Trp His Tyr Tyr     Lys     #            525     -      Arg Lys Ser Lys Arg Tyr Ser Ile - # Leu Lys Gln Ser Leu Ile Arg     Thr     #        540     -      Ala Gln Arg His Ser Asn Arg Thr - # Lys His Arg Ser Trp Lys Lys     Ile     #    560     -      Ala Ile Gln Ile Ile Arg Ser Lys - # Asn Asn Arg Ser Lys Ser Gln     Arg     #   575     -      Leu Glu Ile Ile Ala Tyr Arg Leu - # Arg Lys Ile Ile Gln Ser Tyr     Leu     #                590     -      Ile Ala Phe Leu Val Gly Ile Val - # Ala Tyr Phe Val Leu Gly Ile     Phe     #            605     -      Leu Met Arg Pro Thr Pro Leu Asn - # Asn Leu Lys Arg Pro Cys Phe     Glu     #        620     -      Leu Thr Ala Phe Phe Leu Asn Phe - # Tyr Ser Leu Gly Phe Leu Gln     Ala     #    640     -      Ser Val Lys Gln Leu Lys Lys Leu - # Tyr Leu Gln Cys Cys Ser Asp     Thr     #   655     -      Pro Leu Gly Lys Phe Leu Asn Ala - # Leu Leu Tyr Phe Val Phe Leu     Ala     #                670     -      Arg Phe Ser Ala Ser Leu His Asn - # Lys Glu Leu Lys Leu Tyr Arg     Ile     #            685     -      Phe Arg Ile Ala Ala Gln Gln Tyr - # Phe Gly Leu Arg Asp Phe Ser     Tyr     #        700     -      Arg Leu Ile Lys Val Gln Gly Thr - # Tyr Lys Ser Lys Ser His Leu     Leu     #    720     -      Arg Lys Ile Thr His Ser Leu Gln - # Asn Trp Leu Ser Ile Thr Leu     Asn     #   735     -      Leu Leu Ser Lys Gln Trp Pro Leu - # Phe Cys Leu Glu His Leu Leu     Asp     #                750     -      Leu Phe His Tyr Arg Gly Trp Leu - # Lys Lys Ser His Leu Ser Phe     Ala     #            765     -      Leu Val Gly Trp Ser Leu Gly Arg - # Cys Tyr Gln Val Ala Leu Asn     Arg     #        780     -      Leu Leu Tyr Leu Leu Pro Ser Lys - # Gln Thr Ile Ala Lys Lys Lys     Ile     #    800     -      Trp Tyr Phe Ser Arg Val Phe Ser - # Phe Asp Asn Asn Phe Arg Ala     Ser     #   815     -      Asn Val Ser Cys Cys Arg Phe Ser - # Phe Phe Gln Leu Phe Phe Pro     Lys     #                830     -      Val Gly Phe Phe Phe Lys Ile Leu - # Tyr Ile Val Met Ile Phe Cys     Val     #            845     -      Asp Arg Asn Phe Tyr Glu Arg Ile - # Gln Ile Tyr Ser Phe Gly Phe     Asp     #        860     -      Phe Ser Ile Asp Pro Ile Gln Ile - # Ser Asp Ile Leu Lys Ile Cys     Ile     #    880     -      Thr Gln Phe Ile Leu Asn Lys Gly - # Tyr Asn His Ser Val Ile Leu     Asn     #   895     -      Asn Leu Phe His Lys Ala Cys Asn - # Phe Pro Ile Lys Ile Arg Ala     Phe     #                910     -      Cys Gly Arg Ser Glu Cys Phe Asn - # Asp Ser Lys Trp Ser Pro Met     Leu     #            925     -      Glu Lys Lys Leu Arg Ala Gly Ala - # Met Ala Leu Asn Lys Ala Pro     Val     #        940     -      Arg Glu Leu Ser Ile Arg Val Ala - # Phe Cys Ile Ala Val Phe Asn     His     #    960     -      Val Ser Lys Pro Phe Lys Val Ser - # Ser Asn Ala Leu Ile Pro Ile     Val     #   975     -      Leu Ala Met Tyr Ala Leu Ser Glu - # Glu Leu Pro Arg Ala Glu Ile     Ser     #                990     -      Ile Cys Val Phe Arg Val Thr Leu - # Gly Ser Lys Leu Phe Phe Asn     Ala     #           10050     -      Ser Lys Arg Ala Phe Cys Trp Phe - # Ile Leu Ser Leu Ile Ile Ser     Leu     #       10205     -      Phe Phe Trp Ser Ala Ile Cys Met - # Phe Trp Ile Ser Val Trp Val     Leu     #  10405     -      Ile Phe Cys Phe Ser Thr Ile Ile - # Leu Thr Phe Pro Pro Asn Ala     Leu     # 10550     -      Ser Ala Ala Ile Pro Phe His Asp - # Ala Lys Gln Asp Val Thr Cys     Lys     #               10700 - #                1065     -      Lys Pro Pro Cys His Ala Ile Asp - # Thr Ile Asn Asn Ala Ile Ser     Pro     #           10850     -      Phe His Ser Ala His Gly Cys Lys - # Leu Ser Pro Gln Phe Ser Leu     Tyr     #       11005     -      Leu Arg Leu Asn Gly Phe Phe Phe - # Gly Thr Arg Leu Ser Phe Tyr     Gly     #  11205     -      Phe Phe Gly Leu Ser Gly Phe Leu - # Phe Gly Leu Cys Val Cys Leu     Asn     # 11350     -      Gly Phe Gly Arg Phe Lys Arg Tyr - # Tyr Tyr Leu Cys Thr Ile Gln     Arg     #               11500 - #                1145     -      Gln Thr Thr Lys Ser Gln Glu Arg - # Lys Tyr Ala Lys Thr Phe His     Ile     #           11650     -      Ser Phe Leu Thr Ser Arg Leu Phe - # Phe Ala Pro Phe Leu Leu Ser     Ser     #       11805     -      Tyr Leu Phe Ala Pro Ala Cys Leu - # Cys Asn Gln Gly Lys Ser Asn     Ala     #  12005     -      Ile Ser Lys Lys Ile Tyr Ser Leu - # Leu Tyr Cys Asn Tyr Ile Ala     Asn     # 12150     -      Leu Cys Lys Ser Leu Glu Cys His - # Arg Cys Leu Ile Leu Ile Asp     Val     #               12300 - #                1225     -      Ser Phe Glu Lys Arg Ser Ala Thr - # Asn Trp Tyr Tyr Phe Ser Gln     Ile     #           12450     -      Leu Tyr Ser Ile Leu Ser Lys Phe - # Leu Leu Leu Phe Gln His Ser     Asn     #       12605     -      Lys Gln Gln Glu Pro Cys Pro Gln - # Gly Met Gln Glu Trp Ser Asp     Arg     #  12805     -      Leu Lys Leu Gly His Trp Leu Thr - # Leu Gly Ile His Lys Gln Trp     Ala     # 12950     -      Lys Arg Ser Trp Gly Val Ile Glu - # Ile Cys Met Gln Leu Lys Asn     Thr     #               13100 - #                1305     -      Phe Asp Glu Pro Thr Ser Ala Pro - # Lys Ala Val Gln Leu Ser Arg     Asn     #           13250     -      Arg Cys Ile Ile His Ala Val Ala - # Cys Leu Arg Thr Gln Leu Leu     Leu     #       13405     -      Ser Gly Val Phe Trp Cys Cys Thr - # Cys Asn Ile Ala Tyr Thr Val     Ile     #  13605     -      Ile Ser Thr Val Val Gly Thr Leu - # Gly Thr Ala Ile Phe Val Ala     Ala     # 13750     -      Ser Ile Ala Leu Gly Phe Leu Gly - # Val Val Leu Leu Ala Lys Met     Leu     #               13900 - #                1385     -      Ala Ile Leu Leu Thr Ala Pro Ile - # Cys Ala Pro Cys Val Leu Pro     Cys     #           14050     -      Ala Leu Asn Ser Pro Val Asn Leu - # Leu Ile Phe Lys Ile Leu Phe     Pro     #       14205     -      Thr Ala Met Leu Ser Leu Lys Pro - # Asn Asn Trp Leu Tyr Cys Trp     Leu     #  14405     -      Ala Ala Phe Ile Gly Met Leu Thr - # Ala Ser Ala Met Leu Leu Tyr     Trp     # 14550     -      Val Met Ile Ala Val Ile Cys Gly - # Leu Val Asn Pro Thr Ile Ile     Gly     #               14700 - #                1465     -      Ile Ile Ala Ala Val Ile Ala Pro - # Ala Thr Ile Pro Ala Asn Gly     Pro     #           14850     -      Ala Thr Pro Leu Val Leu Arg Leu - # Arg Lys Leu Pro Ile Arg Ser     Leu     #       15005     -      Gln Lys Met Ile His Ile Leu Val - # Tyr Leu Pro Cys His Pro Leu     Val     #  15205     -      Asn Thr Leu Ile Ser Val Val Lys - # Thr Met Asn Leu Ser Met Leu     Val     # 15350     -      His Asn His Thr Lys Asp Cys Leu - # Asp Arg Asn Arg Pro Ser Phe     Gly     #               15500 - #                1545     -      Ser Glu Asp Phe Glu Cys Pro Thr - # Ser His Tyr Leu Asp Tyr His     Val     #           15650     -      Glu Asn Ser Leu Asn Leu Cys Arg - # Gln Ile Tyr His Phe Tyr His     Tyr     #       15805     -      Ser His Val Trp Pro Leu Phe Ala - # Leu Gly Phe Leu Gly Ser Leu     Phe     #  16005     -      Leu Val Asp Leu Lys Gln Ala Pro - # Arg Tyr Leu Val Pro Tyr Leu     Gln     # 16150     -      Asp Tyr Leu Leu Tyr Gln Cys Phe - # Gly Gln Arg Arg Thr Val Asn     Met     #               16300 - #                1625     -      Pro Met Ile Arg His Gln Ala Asn - # Ser Asn Leu Ile Leu Lys Ser     Val     #           16450     -      Phe Ile Leu Leu Thr Pro Leu Phe - # Leu Phe Leu Ala Leu Phe Arg     Ile     #       16605     -      Asn Ala Thr Gln Ser Leu Phe Leu - # Phe Phe Phe Ser Ser Ile Leu     Phe     #  16805     -      Val Thr Ser Ser Met Phe Lys Tyr - # Phe Ser Lys Ser Ser Phe Phe     Leu     # 16950     -      Ala Ser Thr Leu Ile Ser Ile Ile - # Gln Leu Gly Ile Met Glu Ser     Met     #               17100 - #                1705     -      Ile Lys Cys Met Lys Ser Ala Phe     #            1720     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 5599 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:13:(xi) SEQUENCE DESCRIPTION: SEQ     - AAAAATCATG CCTATGACTT CATGCATTTA ATCATGGACT CCATTATTCC TA - #ATTGGATT       60     - CAGATTGATA TGAGCGTTGA AGCTAAAAAG AAAGAGCTTT TTGAAAAATA TT - #TTCAAAAC      120     - ATTGATGAAG TAACAAACAA AATGCTCGAT CAAAAAAATC AAAACAAAAA TA - #ACGATTGA      180     - GTGGCGTTAA TGCGCTAGAA TAGTGCTAAA AATAAGAATA AAGGAGTCAA AA - #GTATGAAA      240     - ACGGATTTTT ATAAAATTAA ATTACTATTT GCTTGGTGTC TTATCATTGG CA - #TGTTTACA      300     - GTCCGTTAAC GCTGACCAAA ACACTGATAT AAAAGATATT AGTCCTGAAG AT - #ATGGCACT      360     - AAATAGCGTG GGGCTTGTTT CTAGAGATCA ACTAAAAATA GAGATCCCTA AA - #GAAACCCT      420     - AGAGCAAAAA GTGGCCATAC TCAATGACTA TAATGATAAA AATGTTAATA TC - #AAATTTGA      480     - CGACATAAGT TTAGGGAGTT TTCAACCTAA TGATAATCTA GGTATCAATG CG - #ATGTGGGG      540     - CATTCAAAAT CTTCTCATGA GCCAAATGAT GGGCGATTAC GGTCCAAACA AT - #CCTTTCAT      600     - GTATGGTTAT GCACCAACAT ACTCAGATTC ATCGTTTTTA CCACCGATAT TA - #GGGTATTA      660     - ACTAGGGGGT GTTAACATGG CAGGTACACA AGCTATATAT GAATCATCTT CT - #GCAGGCTT      720     - CTTATCGGAA GTTTCCTCAA TCATCTCAAC ACAAGTGGTG TCGCAGGACC AT - #TTGCAGGA      780     - ATAGTAGCGG GTGCTATGAC AGCAGCGATT ATTCCTATTA TTGTTGGGTT TA - #CTAATCCG      840     - CAAATGACCG CTATCATGAC CCAATACAAT CAAAGCATCG CTGAAGCCGT AA - #GCATGCCT      900     - ATGAAAGCCG CTAACCAACA ATACAACCAA TTGTTTCAAG GTTTTAACGA TC - #AAAGCATG      960     - GCTGTGGGGA ACAATATCTT AAATATTAGC AAATTAACAG GGGAATTTAA CG - #CACAAGGC     1020     - AACACGCAAG GCGCGCAAAT TGGTGCTGTT AATAGTCAGA TTGCAAGCAT TT - #TAGCGAGT     1080     - AACACCACCC CTAAAAATCC CAGTGCTATT GAAGCTTATG CGACAAATCA AA - #TCGCTGTT     1140     - CCTAGCGTGC CAACAACGGT TGAAATGATG ACGGTATATT AGGCAATATT AC - #AAGTGCAG     1200     - CACCAAAATA CGCCCTAGCT CTACAAGAGC AACTGCGTTC TCAAGCAAGC AA - #CAGCTCAA     1260     - TGAATGATAC ACCGATTCCT TGATAGCTGT ACCGCTTTAG GTGCGCTTGT TG - #GCTCATCA     1320     - AAAGTGTTTT TCAGTTGCAT GCAAATTTCT ATGACGCCCC ATGAGCGTTT CT - #ATGCCCAC     1380     - TGTTTATGCT AAATACCAAG CGTTAGCCAC TAATGCCCTA ACTTCAGGCG TT - #AATCCTAT     1440     - GACCACTCCT GCATGCCCTA TTGGGGACAA GGTTCTTGCT GTTTATTGCT AT - #GCTGAAAA     1500     - AGTAGCAGAA ATTTTGAGAG AATACTATAT AGAATTTGTG AAAAATAATA CC - #AATTTGTT     1560     - GCAGAACGCT TCTCAAATGA TACTTAATCA ATCAGGATTA GCTACTAGCA CC - #TATGACAC     1620     - TCAAGCGATT TCTAACATAA GTTCGCTATA TAATTACAAT ATAGTAGCGA AT - #AAATCTTT     1680     - TTTGAAATCG CATTTGACTT ACCTTGATTA CATCAAAGAC AAGCTTAAGG GG - #CAAAAAGA     1740     - TAGCTACTTA ACAGAAAGGG TGCAAACTAA AATAATCGTG AAGTGAGGAA AG - #AGATGTGA     1800     - AATGTTTTTT AAGCATATTT TCTTTCTTGA CTTTTTGTGG TTTGTCTCTG AA - #TGGTACAG     1860     - AGGTAGTAAT AACGCTTGAA CCTGCCTTAA AAGCCATTCA GGCAGACGCA CA - #AGCCAAAC     1920     - AAAAAACCGC TCAAGCCGAA TTAAAAGCCA TAGAAGCTCA ATCTAGTGCC AA - #AGAAAAAG     1980     - CCATTCAAGC GCAAATAGAG GGAGAATTGA GGACTCAGCT TGCAACCATG AG - #CGCTATGT     2040     - TAAAAGGGGC TAATGGCGTT ATTAATGGTG TCAATGGCAT GACAGGGGGG TT - #TTTTGCAG     2100     - GTTCAGACAT CTTGCTTGGC GTCATGGAAG GGTATTCAAG CGGCGCTTAG TG - #CATTGGGG     2160     - GGGAATGTCA AAATGATCGT GGAAAAACAA AAAATTAATA CCCAAACAGA AA - #TCCAAAAC     2220     - ATGCAAATCG CGCTCCAAAA AAATAACGAA ATGATCAAGC TCAAAATGAA CC - #AGCAAAAC     2280     - GCTCTCTTAG AAGCGTTAAA AAACAGCTTT GAACCGAGGG TTACCCTAAA AA - #CACAAATG     2340     - GAAATTTCTC AAGCTCTAGG AAGTTCTTCT GACAACGCTC AATACATCGC TT - #ACAATACG     2400     - ATTGGTATCA AGGCGTTTGA AGAAACCTTA AAAGGTTTTG AGACATGGTT GA - #AAACAGCT     2460     - ATGCAAAAAG CGACCCTTAT TGATTATAAT TCCCTAACAG GTCAGGCTTT GT - #TTCAAAGT     2520     - GCCATCTATG CGCCTGCTCT TAGTTTTTTT TCAAGCATGG GCGACCATTT GG - #AATCATTG     2580     - AAACATTCAC TCTAGCGCCC ACAAAATGCC CTTATCTTGA TGGGCTAAAA AT - #TTCAGCAT     2640     - GCCTTATGGA ACAGGTTATT CAGAATTACA GAATGATTGT AGCCCTTATT CA - #AAATAAAC     2700     - TGAGTGATGC AGATTTTCAA AATATCGCTT ATTTGAATGG GATCAATGGA GA - #AATCAAAA     2760     - CCTTAAAAGG ATCAGTAGAT TTGAATGCGC TCATAGAAGT TGCGATCTTA AA - #CGCAGAAA     2820     - ATCATTTAAA CTATATAGAG AATCTTGAAA AAAAAGCCGA CCTTTGGGAA GA - #ACAACTGA     2880     - AATTAGAAAG AGAAACGACA GCAAGAAACA TTGCTAGCTC TAAAGTTATT GT - #CAAATGAA     2940     - AACACTCGTG AAAAATACCA TATCTTCTTT TTTGCTATTG TCTGTTTTGA TG - #GCAGAAGA     3000     - TATAACAAGC GGTTTAAAGC AACTTGATAG CACCTACCAA GAGACCAACC AA - #CAAGTGCT     3060     - TAAAAACTTA GATGAGATTT TTTCAACCAC TAGCCCTAGC GCTAATAATG AA - #ATAGGTCA     3120     - AGAAGATGCT CTAAACATCA AAAAAGCGGC CATTGCTTTG AGAGGAGATT TA - #GCGTTATT     3180     - GAAAGCCAAT TTTGAAGCGA ATGAGTTATT TTTCATCTCA GAAGATGTGA TT - #TTCAAGAC     3240     - TTATATGTCT AGCCCTGAAC TTTTATTAAC CTATATGAAA ATCAATCCCT TA - #GACCAAAA     3300     - TACTGCTGAG CAGCAATGCG GAATATCCGA TAAAGTTTTA GTTCTTTATT GT - #GAAGGGAA     3360     - GCTGAAAATC GAGCAAGAAA AACAAAATAT AAGAGAGCGT TTAGAAACTT CC - #CTAAAGGC     3420     - GTATCAGAGC AACATTGGAG GTACAGCTTC TTTAATCACT GCTTCACAGA CG - #CTTGTAGA     3480     - AAGCCTAAAG AATAAAAATT TCATCAAAGG AATCAGAAAG CTTATGTTAG CT - #CAAAACAA     3540     - GGTCTTTTTA AATTATTTAG AGGAGTTGGA CGCATTAGAA AGATCCCTAG AA - #CAAAGTAA     3600     - GCGACAATAC CTACAAGAAA GGCAATCAAG TAAGATCATT GTTAAATGAT TT - #TTCTGAGA     3660     - CGATAGGCTA TGATTTCTAA CCTTTGGGAT TTGCTTCTCT AGTTGTTTTT TG - #ATCTAATA     3720     - ATTTGGATCG CTATTTTTTT CCATGAGCGA TGCTTTGTTC TATTTGATTA AT - #GACGCTAT     3780     - TGGGCAGTTC GGATTTAGAG CGATTGTTTG AGAATGCTGT ATCTTTTGCT TC - #ATTTTCTT     3840     - TTGTATCAGT AGTGCCAAAC TCGCTTGCAT TTGTAGTATC CTGGTAGTTA TT - #TTCTTGCT     3900     - CCATTTCTTT TTTTCTCTTC TCACAAATAG CGCACGAAGC TATAAGAAAG GT - #TCTATTGT     3960     - TGCTGTTATT AAGAGTGAGT TTTAAACCAC TTAGGTGTTG TCTAGAAAAC TT - #GTTATTTT     4020     - CTGTCTCTAT TTCTTGAATT TTTTCTAATA AGGGGGCAAG AATTTTTTGA TC - #CTTGATGT     4080     - CTTTGATCAA ACACTTTCTT AACCCATCAT AATCGTTGCT TGAAATAATA CA - #CTCATTGA     4140     - TAATGATTTC AATTTCAGGC GTTCTCCACT CATCAGGCAC AAGCGGAGCA GC - #AGGTTTCA     4200     - CAAAAGATTC ATTCACCTTA TTGATCATCT CATCAAAGTT ATCATCTTTA TT - #AGATTTGG     4260     - TTTCAGTGGA AGAATTGCCA TTAGGGTTGT CATTCAAAGG GTTAGCGTTA AT - #TTCGTTTA     4320     - ATTCTTGTAA TCCTTTGTGC TTAGCCGCTT TTTTTTGAAC AGAAGAGTTG TT - #CTTTTTTT     4380     - CAAACGCTTC AATATTCTTT TGAATTTGCT CTGCTCTCGC CCTATCATTT CG - #TAATTTTT     4440     - CTCTCAAAGC GTTAGTGGCG AATTCTTTTT CCTCTTTTGT AATTTCTTTG TC - #ATCATGTC     4500     - TTTGATAGAG TGTTTTGATT TTTTTAAAGC CCAAAGCCAC TAATTCTTTG TA - #TTCACTCG     4560     - GCAATAATGC TGTATTGATA GCGTTTAAGC CCCCATAAAC AACTAAAGAA TA - #AAGAGCGA     4620     - CAGAACCTAG TTTTTTGAAG ATGCTTTTCA AGAGTGTCTC CTTTCCTATT CA - #AAGGGATT     4680     - ATTCTTGGCA ACATTGTTTC GGTATGCAGA AACCACTGTT TTGAAGAAAT TA - #AATATCTT     4740     - GTAGTGTTCT TTTTGCATAG TTGTTTTGTC GCTCATCATA GGATCGCTAT CT - #ACGAAAAT     4800     - ATTGTATTCA TTAGTCTTTT TTAATATAGA TTTTTTGGAT TGGGTATCAA AA - #TTGAAGCT     4860     - GTCTAAAACC AACAAACACA ATTTTTGTTT TTGTGCGAAA CTCAATTTTT GA - #TTGGTAAA     4920     - GTTTTTAGTC AGAGTATTCA ATTCCACATC ATCTTTGACA TTTTTCAACA AG - #TTAGAAAA     4980     - AGCCTCATAG CCCTTTAAGT ATTTCTCATT AATCTTATCA ATCGCCATGC TA - #AACGCCAA     5040     - GCGTTCCGAG TTAAAAATAA GCGTGTTAGT AGCCTTTTGA GCAATATCTT TT - #GCTTTGGA     5100     - GTCAAACTTG TTTCTTTCAA GCTGTAAGTA ATAATCTATA TTATCCAATT CT - #TTTCTGCT     5160     - CAACAGATTA GAAATGTTAT TGGAATATTC TTCAATCGCT TGCTTGTAAG AT - #AAAGCTAA     5220     - TCGGCTCGCT TTTTTGTTTT GTTCTTCATC TAAAGAAAAC ACACCCATTT GA - #AGCAAAGC     5280     - CTGAGCCTTA TCTTTAGTGG TAGCGGTAGT TTTTTTAAAA ATACTGCTCG TT - #TTCACCTG     5340     - GGGGTAGTAA TAACTCAAAC GCTCCTTATC TTCTTTGGTG AATGTTTCTT GG - #TTTAAAAC     5400     - ATAGGCATAA GGGTTAGGAA GAAGTTTCTT ATAGAGAGGG TTATCTTTTA AG - #GCTTGCTC     5460     - TAGATCCCCA TAAGCCTTTG ACTTAGCAGT AAGGATTTTA CCTCTTGTCT CT - #TTCTCTAG     5520     - AGCCTCTTTT TCTTTTTTGT ATTTGTTTTC TAAGTCTGAA AGAACTTTAG CT - #CTATGATC     5580     #                 559 - #9     - (2) INFORMATION FOR SEQ ID NO:14:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1732 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:14:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Asn His Ala Tyr Asp Phe Met - # His Leu Ile Met Asp Ser Ile     Ile     #   15     -      Pro Asn Trp Ile Gln Ile Asp Met - # Ser Val Glu Ala Lys Lys Lys     Glu     #                 30     -      Leu Phe Glu Lys Tyr Phe Gln Asn - # Ile Asp Glu Val Thr Asn Lys     Met     #             45     -      Leu Asp Gln Lys Asn Gln Asn Lys - # Asn Asn Asp Val Ala Leu Met     Arg     #         60     -      Asn Ser Ala Lys Asn Lys Asn Lys - # Gly Val Lys Ser Met Lys Thr     Asp     #     80     -      Phe Tyr Lys Ile Lys Leu Leu Phe - # Ala Trp Cys Leu Ile Ile Gly     Met     #   95     -      Phe Thr Val Arg Arg Pro Lys His - # Tyr Lys Arg Tyr Ser Arg Tyr     Gly     #                110     -      Thr Lys Arg Gly Ala Cys Phe Arg - # Ser Thr Lys Asn Arg Asp Pro     Arg     #            125     -      Asn Pro Arg Ala Lys Ser Gly His - # Thr Gln Leu Lys Cys Tyr Gln     Ile     #        140     -      Arg His Lys Phe Arg Glu Phe Ser - # Thr Ser Arg Tyr Gln Cys Asp     Val     #    160     -      Gly His Ser Lys Ser Ser His Glu - # Pro Asn Asp Gly Arg Leu Arg     Ser     #   175     -      Lys Gln Ser Phe His Val Trp Leu - # Cys Thr Asn Ile Leu Arg Phe     Ile     #                190     -      Val Phe Thr Thr Asp Ile Arg Val - # Leu Thr Arg Gly Cys His Gly     Arg     #            205     -      Tyr Thr Ser Tyr Ile Ile Ile Phe - # Cys Arg Leu Leu Ile Gly Ser     Phe     #        220     -      Leu Asn His Leu Asn Thr Ser Gly - # Val Ala Gly Pro Phe Ala Gly     Ile     #    240     -      Val Ala Gly Ala Met Thr Ala Ala - # Ile Ile Pro Ile Ile Val Gly     Phe     #   255     -      Thr Asn Pro Gln Met Thr Ala Ile - # Met Thr Gln Tyr Asn Gln Ser     Ile     #                270     -      Ala Glu Ala Val Ser Met Pro Met - # Lys Ala Ala Asn Gln Gln Tyr     Asn     #            285     -      Gln Leu Phe Gln Gly Phe Asn Asp - # Gln Ser Met Ala Val Gly Asn     Asn     #        300     -      Ile Leu Asn Ile Ser Lys Leu Thr - # Gly Glu Phe Asn Ala Gln Gly     Asn     #    320     -      Thr Gln Gly Ala Gln Ile Gly Ala - # Val Asn Ser Gln Ile Ala Ser     Ile     #   335     -      Leu Ala Ser Asn Thr Thr Pro Lys - # Asn Pro Ser Ala Ile Glu Ala     Tyr     #                350     -      Ala Thr Asn Gln Ile Ala Val Pro - # Ser Val Pro Thr Thr Val Glu     Met     #            365     -      Met Thr Val Tyr Ala Ile Leu Gln - # Val Gln His Gln Asn Thr Pro     Leu     #        380     -      Tyr Lys Ser Asn Cys Val Leu Lys - # Gln Ala Thr Ala Gln Met Ile     His     #    400     -      Arg Phe Leu Asp Ser Cys Thr Ala - # Leu Gly Ala Leu Val Gly Ser     Ser     #   415     -      Lys Val Phe Phe Ser Cys Met Gln - # Ile Ser Met Thr Pro His Glu     Arg     #                430     -      Phe Tyr Ala His Cys Leu Cys Ile - # Pro Ser Val Ser His Cys Pro     Asn     #            445     -      Phe Arg Arg Ser Tyr Asp His Ser - # Cys Met Pro Tyr Trp Gly Gln     Gly     #        460     -      Ser Cys Cys Leu Leu Leu Cys Lys - # Ser Ser Arg Asn Phe Glu Arg     Ile     #    480     -      Leu Tyr Arg Ile Cys Glu Lys Tyr - # Gln Phe Val Ala Glu Arg Phe     Ser     #   495     -      Asn Asp Thr Ser Ile Arg Ile Ser - # Tyr His Leu His Ser Ser Asp     Phe     #                510     -      His Lys Phe Ala Ile Leu Gln Tyr - # Ser Ser Glu Ile Phe Phe Glu     Ile     #            525     -      Ala Phe Asp Leu Pro Leu His Gln - # Arg Gln Ala Gly Ala Lys Arg     Leu     #        540     -      Leu Asn Arg Lys Gly Ala Asn Asn - # Asn Arg Glu Val Arg Lys Glu     Met     #    560     -      Asn Val Phe Ala Tyr Phe Leu Ser - # Leu Phe Val Val Cys Leu Met     Val     #   575     -      Gln Arg Arg Leu Asn Leu Pro Lys - # Pro Phe Arg Gln Thr His Lys     Pro     #                590     -      Asn Lys Lys Pro Leu Lys Pro Asn - # Lys Pro Lys Leu Asn Leu Val     Pro     #            605     -      Lys Lys Lys Pro Phe Lys Arg Lys - # Arg Glu Asn Gly Leu Ser Leu     Gln     #        620     -      Pro Ala Leu Cys Lys Gly Leu Met - # Ala Leu Leu Met Val Ser Met     Ala     #    640     -      Gln Gly Gly Phe Leu Gln Val Gln - # Thr Ser Cys Leu Ala Ser Trp     Lys     #   655     -      Gly Ile Gln Ala Ala Leu Ser Ala - # Leu Gly Gly Asn Val Lys Met     Ile     #                670     -      Val Glu Lys Gln Lys Ile Asn Thr - # Gln Thr Glu Ile Gln Asn Met     Gln     #            685     -      Ile Ala Leu Gln Lys Asn Asn Glu - # Met Ile Lys Leu Lys Met Asn     Gln     #        700     -      Gln Asn Ala Leu Leu Glu Ala Leu - # Lys Asn Ser Phe Glu Pro Arg     Val     #    720     -      Thr Leu Lys Thr Gln Met Glu Ile - # Ser Gln Ala Leu Gly Ser Ser     Ser     #   735     -      Asp Asn Ala Gln Tyr Ile Ala Tyr - # Asn Thr Ile Gly Ile Lys Ala     Phe     #                750     -      Glu Glu Thr Leu Lys Gly Phe Glu - # Thr Trp Leu Lys Thr Ala Met     Gln     #            765     -      Lys Ala Thr Leu Ile Asp Tyr Asn - # Ser Leu Thr Gly Gln Ala Leu     Phe     #        780     -      Gln Ser Ala Ile Tyr Ala Pro Ala - # Leu Ser Phe Phe Ser Ser Met     Gly     #    800     -      Asp His Leu Glu Ser Leu Lys His - # Ser Leu Arg Pro Gln Asn Ala     Leu     #   815     -      Ile Leu Met Gly Lys Phe Gln His - # Ala Leu Trp Asn Arg Leu Phe     Arg     #                830     -      Ile Thr Glu Leu Pro Leu Phe Lys - # Ile Asn Val Met Gln Ile Phe     Lys     #            845     -      Ile Ser Leu Ile Met Gly Ser Met - # Glu Lys Ser Lys Pro Lys Asp     Gln     #        860     -      Ile Met Arg Ser Lys Leu Arg Ser - # Thr Gln Lys Ile Ile Thr Ile     Arg     #    880     -      Ile Leu Lys Lys Lys Pro Thr Phe - # Gly Lys Asn Asn Asn Lys Glu     Lys     #   895     -      Arg Gln Gln Glu Thr Leu Leu Ala - # Leu Lys Leu Leu Ser Asn Glu     Asn     #                910     -      Thr Arg Glu Lys Tyr His Ile Phe - # Phe Phe Ala Ile Val Cys Phe     Asp     #            925     -      Gly Arg Arg Tyr Asn Lys Arg Phe - # Lys Ala Thr His Leu Pro Arg     Asp     #        940     -      Gln Pro Thr Ser Ala Lys Leu Arg - # Asp Phe Phe Asn His Pro Arg     Asn     #    960     -      Arg Ser Arg Arg Cys Ser Lys His - # Gln Lys Ser Gly His Cys Phe     Glu     #   975     -      Arg Arg Phe Ser Val Ile Glu Ser - # Gln Phe Ser Glu Val Ile Phe     His     #                990     -      Leu Arg Arg Cys Asp Phe Gln Asp - # Leu Tyr Val Pro Thr Phe Ile     Asn     #           10050     -      Leu Tyr Glu Asn Gln Ser Leu Arg - # Pro Lys Tyr Cys Ala Ala Met     Arg     #       10205     -      Asn Ile Arg Ser Phe Ser Ser Leu - # Leu Arg Glu Ala Glu Asn Arg     Ala     #  10405     -      Arg Lys Thr Lys Tyr Lys Arg Ala - # Phe Arg Asn Phe Pro Lys Gly     Val     # 10550     -      Ser Glu Gln His Trp Arg Tyr Ser - # Phe Phe Asn His Cys Phe Thr     Asp     #               10700 - #                1065     -      Ala Cys Arg Lys Pro Lys Glu Lys - # Phe His Gln Arg Asn Gln Lys     Ala     #           10850     -      Tyr Val Ser Ser Lys Gln Gly Leu - # Phe Lys Leu Phe Arg Gly Val     Gly     #       11005     -      Arg Ile Arg Lys Ile Pro Arg Thr - # Lys Ala Thr Ile Pro Thr Arg     Lys     #  11205     -      Ala Ile Lys Asp His Cys Met Ile - # Phe Leu Arg Arg Ala Met Ile     Ser     # 11350     -      Asn Leu Trp Asp Leu Leu Leu Leu - # Phe Phe Asp Leu Ile Ile Trp     Ile     #               11500 - #                1145     -      Ala Ile Phe Phe His Glu Arg Cys - # Phe Val Leu Phe Asp Arg Tyr     Trp     #           11650     -      Ala Val Arg Ile Ser Asp Cys Leu - # Arg Met Leu Tyr Leu Leu Leu     His     #       11805     -      Phe Leu Leu Tyr Gln Cys Gln Thr - # Arg Leu His Leu Tyr Pro Gly     Ser     #  12005     -      Tyr Phe Leu Ala Pro Phe Leu Phe - # Phe Ser Ser His Lys Arg Thr     Lys     # 12150     -      Leu Glu Arg Phe Tyr Cys Cys Cys - # Tyr Glu Val Leu Asn His Leu     Gly     #               12300 - #                1225     -      Val Val Lys Thr Cys Tyr Phe Leu - # Ser Leu Phe Leu Glu Phe Phe     Leu     #           12450     -      Ile Arg Gly Gln Glu Phe Phe Asp - # Pro Cys Leu Ser Asn Thr Phe     Leu     #       12605     -      Thr His His Asn Arg Cys Leu Lys - # Tyr Thr His Phe Gln Phe Gln     Ala     #  12805     -      Phe Ser Thr His Gln Ala Gln Ala - # Glu Gln Gln Val Ser Gln Lys     Ile     # 12950     -      His Ser Pro Tyr Ser Ser His Gln - # Ser Tyr His Leu Tyr Ile Trp     Phe     #               13100 - #                1305     -      Gln Trp Lys Asn Cys His Gly Cys - # His Ser Lys Gly Arg Phe Arg     Leu     #           13250     -      Ile Leu Val Ile Leu Cys Ala Pro - # Leu Phe Phe Glu Gln Lys Ser     Cys     #       13405     -      Ser Phe Phe Gln Thr Leu Gln Tyr - # Ser Phe Glu Phe Ala Leu Leu     Ser     #  13605     -      Pro Tyr His Phe Val Ile Phe Leu - # Ser Lys Arg Trp Arg Ile Leu     Phe     # 13750     -      Pro Leu Leu Phe Leu Cys His His - # Val Phe Asp Arg Val Phe Phe     Phe     #               13900 - #                1385     -      Ser Pro Lys Pro Leu Ile Leu Cys - # Ile His Ser Ala Ile Met Leu     Tyr     #           14050     -      Arg Leu Ser Pro His Lys Gln Leu - # Lys Asn Lys Glu Arg Gln Asn     Leu     #       14205     -      Val Phe Arg Cys Phe Ser Arg Val - # Ser Pro Phe Leu Phe Lys Gly     Ile     #  14405     -      Ile Leu Gly Asn Ile Val Ser Val - # Cys Arg Asn His Cys Phe Glu     Glu     # 14550     -      Ile Lys Tyr Leu Val Val Phe Phe - # Leu His Ser Cys Phe Val Ala     His     #               14700 - #                1465     -      His Arg Ile Ala Ile Tyr Glu Asn - # Ile Val Phe Ile Ser Leu Phe     Tyr     #           14850     -      Arg Phe Phe Gly Leu Gly Ile Lys - # Ile Glu Ala Val Asn Gln Gln     Thr     #       15005     -      Gln Phe Leu Phe Leu Cys Glu Thr - # Gln Phe Leu Ile Gly Lys Val     Phe     #  15205     -      Ser Gln Ser Ile Gln Phe His Ile - # Ile Phe Asp Ile Phe Gln Gln     Val     # 15350     -      Arg Lys Ser Leu Ile Ala Leu Val - # Phe Leu Ile Asn Leu Ile Asn     Arg     #               15500 - #                1545     -      His Ala Lys Arg Gln Ala Phe Arg - # Val Lys Asn Lys Arg Val Ser     Ser     #           15650     -      Leu Leu Ser Asn Ile Phe Cys Phe - # Gly Val Lys Leu Val Ser Phe     Lys     #       15805     -      Leu Val Ile Ile Tyr Ile Ile Gln - # Phe Phe Ser Ala Gln Gln Ile     Arg     #  16005     -      Asn Val Ile Gly Ile Phe Phe Asn - # Arg Leu Leu Val Arg Ser Ser     Ala     # 16150     -      Arg Phe Phe Val Leu Phe Phe Ile - # Arg Lys His Thr His Leu Lys     Gln     #               16300 - #                1625     -      Ser Leu Ser Leu Ile Phe Ser Gly - # Ser Gly Ser Phe Phe Lys Asn     Thr     #           16450     -      Ala Arg Phe His Leu Gly Val Val - # Ile Thr Gln Thr Leu Leu Ile     Phe     #       16605     -      Phe Gly Glu Cys Phe Leu Val Asn - # Ile Gly Ile Arg Val Arg Lys     Lys     #  16805     -      Phe Leu Ile Glu Arg Val Ile Phe - # Gly Leu Leu Ile Pro Ile Ser     Leu     # 16950     -      Leu Ser Ser Lys Asp Phe Thr Ser - # Cys Leu Phe Leu Ser Leu Phe     Phe     #               17100 - #                1705     -      Phe Phe Val Phe Val Phe Val Lys - # Asn Phe Ser Ser Met Ile His     Lys     #           17250     -      Leu Phe Asp Phe              1730     - (2) INFORMATION FOR SEQ ID NO:15:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1724 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:15:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Ser Cys Leu Leu His Ala Phe - # Asn His Gly Leu His Tyr Ser     Leu     #   15     -      Asp Ser Asp Tyr Glu Arg Ser Lys - # Glu Arg Ala Phe Lys Ile Phe     Ser     #                 30     -      Lys His Ser Asn Lys Gln Asn Ala - # Arg Ser Lys Lys Ser Lys Gln     Lys     #             45     -      Arg Leu Ser Gly Val Asn Ala Leu - # Glu Cys Lys Glu Arg Ser Gln     Lys     #         60     -      Tyr Glu Asn Gly Phe Leu Asn Ile - # Thr Ile Cys Leu Val Ser Tyr     His     #     80     -      Trp His Val Tyr Ser Pro Leu Thr - # Leu Thr Lys Thr Leu Ile Lys     Ile     #   95     -      Leu Val Leu Lys Ile Trp His Ile - # Ala Trp Gly Leu Phe Leu Glu     Ile     #                110     -      Asn Lys Arg Ser Leu Lys Lys Pro - # Ser Lys Lys Trp Pro Tyr Ser     Met     #            125     -      Thr Ile Met Ile Lys Met Leu Ile - # Ser Asn Leu Thr Thr Val Gly     Val     #        140     -      Phe Asn Leu Met Ile Ile Val Ser - # Met Arg Cys Gly Ala Phe Lys     Ile     #    160     -      Phe Ser Ala Lys Trp Ala Ile Thr - # Val Gln Thr Ile Leu Ser Cys     Met     #   175     -      Val Met His Gln His Thr Gln Ile - # His Arg Phe Tyr His Arg Tyr     Gly     #                190     -      Ile Asn Gly Val Leu Thr Trp Gln - # Val His Lys Leu Tyr Met Asn     His     #            205     -      Leu Leu Gln Ala Ser Tyr Arg Lys - # Phe Pro Gln Ser Ser Gln His     Lys     #        220     -      Trp Cys Arg Arg Thr Ile Cys Arg - # Asn Ser Ser Gly Cys Tyr Asp     Ser     #    240     -      Ser Asp Tyr Ser Tyr Tyr Cys Trp - # Val Tyr Ser Ala Asn Asp Arg     Tyr     #   255     -      His Asp Pro Ile Gln Ser Lys His - # Arg Ser Arg Lys His Ala Tyr     Glu     #                270     -      Ser Arg Pro Thr Ile Gln Pro Ile - # Val Ser Arg Phe Arg Ser Lys     His     #            285     -      Gly Cys Gly Glu Gln Tyr Leu Lys - # Tyr Gln Ile Asn Arg Gly Ile     Arg     #        300     -      Thr Arg Gln His Ala Arg Arg Ala - # Asn Trp Cys Cys Ser Asp Cys     Lys     #    320     -      His Phe Ser Glu His His Pro Lys - # Ser Gln Cys Tyr Ser Leu Cys     Asp     #   335     -      Lys Ser Asn Arg Cys Ser Arg Ala - # Asn Asn Gly Asn Asp Asp Gly     Ile     #                350     -      Leu Gly Asn Ile Thr Ser Ala Ala - # Pro Lys Tyr Ala Leu Ala Leu     Gln     #            365     -      Glu Gln Leu Arg Ser Gln Ala Ser - # Asn Ser Ser Met Asn Asp Thr     Pro     #        380     -      Ile Pro Leu Tyr Arg Phe Arg Cys - # Ala Cys Trp Leu Ile Lys Ser     Val     #    400     -      Phe Gln Leu His Ala Asn Phe Tyr - # Asp Ala Pro Ala Phe Leu Cys     Pro     #   415     -      Leu Phe Met Leu Asn Thr Lys Arg - # Pro Leu Met Pro Leu Gln Ala     Leu     #                430     -      Ile Leu Pro Leu Leu His Ala Leu - # Leu Gly Thr Arg Phe Leu Leu     Phe     #            445     -      Ile Ala Met Leu Lys Lys Gln Lys - # Phe Glu Asn Thr Ile Asn Leu     Lys     #        460     -      Ile Ile Pro Ile Cys Cys Arg Thr - # Leu Leu Lys Tyr Leu Ile Asn     Gln     #    480     -      Asp Leu Leu Ala Pro Met Thr Leu - # Lys Arg Phe Leu Thr Val Arg     Tyr     #   495     -      Ile Ile Thr Ile Arg Ile Asn Leu - # Phe Asn Arg Ile Leu Thr Leu     Ile     #                510     -      Thr Ser Lys Thr Ser Leu Arg Gly - # Lys Lys Ile Ala Thr Gln Lys     Gly     #            525     -      Cys Lys Leu Lys Ser Ser Glu Glu - # Arg Asp Val Lys Cys Phe Leu     Ser     #        540     -      Ile Phe Ser Phe Leu Thr Phe Cys - # Gly Leu Ser Leu Asn Gly Thr     Glu     #    560     -      Val Val Ile Thr Leu Glu Pro Ala - # Leu Lys Ala Ile Gln Ala Asp     Ala     #   575     -      Gln Ala Lys Gln Lys Thr Ala Gln - # Ala Glu Leu Lys Ala Ile Glu     Ala     #                590     -      Gln Ser Ser Ala Lys Glu Lys Ala - # Ile Gln Ala Gln Ile Glu Gly     Glu     #            605     -      Leu Arg Thr Gln Leu Ala Thr Met - # Ser Ala Met Leu Lys Gly Ala     Asn     #        620     -      Gly Val Ile Asn Gly Val Asn Gly - # Met Thr Gly Gly Phe Phe Ala     Gly     #    640     -      Ser Asp Ile Leu Leu Gly Val Met - # Glu Gly Tyr Ser Ser Gly Ala     Cys     #   655     -      Ile Gly Gly Glu Cys Gln Asn Asp - # Arg Gly Lys Thr Lys Asn Tyr     Pro     #                670     -      Asn Arg Asn Pro Lys His Ala Asn - # Arg Ala Pro Lys Lys Arg Asn     Asp     #            685     -      Gln Ala Gln Asn Glu Pro Ala Lys - # Arg Ser Leu Arg Ser Val Lys     Lys     #        700     -      Gln Leu Thr Glu Gly Tyr Pro Lys - # Asn Thr Asn Gly Asn Phe Ser     Ser     #    720     -      Ser Arg Lys Phe Phe Gln Arg Ser - # Ile His Arg Leu Gln Tyr Asp     Trp     #   735     -      Tyr Gln Gly Val Arg Asn Leu Lys - # Arg Phe Asp Met Val Glu Asn     Ser     #                750     -      Tyr Ala Lys Ser Asp Pro Tyr Leu - # Phe Pro Asn Arg Ser Gly Phe     Val     #            765     -      Ser Lys Cys His Leu Cys Ala Cys - # Ser Phe Phe Phe Lys His Gly     Arg     #        780     -      Pro Phe Gly Ile Ile Glu Thr Phe - # Thr Leu Ala Pro Thr Lys Cys     Pro     #    800     -      Tyr Leu Asp Gly Leu Lys Ile Ser - # Ala Cys Leu Met Glu Gln Val     Ile     #   815     -      Gln Asn Tyr Arg Met Ile Val Ala - # Leu Ile Gln Asn Lys Leu Ser     Asp     #                830     -      Ala Asp Phe Gln Asn Ile Ala Tyr - # Leu Asn Gly Ile Asn Gly Glu     Ile     #            845     -      Lys Thr Leu Lys Gly Ser Val Asp - # Leu Asn Ala Leu Ile Glu Val     Ala     #        860     -      Ile Leu Asn Ala Glu Asn His Leu - # Asn Tyr Ile Glu Asn Leu Glu     Lys     #    880     -      Lys Ala Asp Leu Trp Glu Glu Gln - # Leu Lys Leu Glu Arg Glu Thr     Thr     #   895     -      Ala Arg Asn Ile Ala Ser Ser Lys - # Val Ile Val Lys Lys His Ser     Lys     #                910     -      Ile Pro Tyr Leu Leu Phe Cys Tyr - # Cys Leu Phe Trp Gln Lys Ile     Gln     #            925     -      Ala Val Ser Asn Leu Ile Ala Pro - # Thr Lys Arg Pro Thr Asn Lys     Cys     #        940     -      Leu Lys Thr Met Arg Phe Phe Gln - # Pro Leu Ala Leu Ala Leu Ile     Met     #    960     -      Lys Val Lys Lys Met Leu Thr Ser - # Lys Lys Arg Pro Leu Leu Glu     Glu     #   975     -      Ile Arg Tyr Lys Pro Ile Leu Lys - # Arg Met Ser Tyr Phe Ser Ser     Gln     #                990     -      Lys Met Phe Ser Arg Leu Ile Cys - # Leu Ala Leu Asn Phe Tyr Pro     Ile     #           10050     -      Lys Ser Ile Pro Thr Lys Ile Leu - # Leu Ser Ser Asn Ala Glu Tyr     Pro     #       10205     -      Ile Lys Phe Phe Phe Ile Val Lys - # Gly Ser Lys Ser Ser Lys Lys     Asn     #  10405     -      Lys Ile Glu Ser Val Lys Leu Pro - # Arg Arg Ile Arg Ala Thr Leu     Glu     # 10550     -      Val Gln Leu Leu Ser Leu Leu His - # Arg Arg Leu Lys Ala Arg Ile     Lys     #               10700 - #                1065     -      Ile Ser Ser Lys Glu Ser Glu Ser - # Leu Cys Leu Lys Thr Arg Ser     Phe     #           10850     -      Ile Ile Arg Ser Trp Thr His Lys - # Asp Pro Asn Lys Val Ser Asp     Asn     #       11005     -      Thr Tyr Lys Lys Gly Asn Gln Val - # Arg Ser Leu Leu Asn Asp Phe     Ser     #  11205     -      Glu Thr Ile Gly Tyr Asp Phe Pro - # Leu Gly Phe Ala Ser Leu Val     Val     # 11350     -      Phe Ser Asn Asn Leu Asp Arg Tyr - # Phe Phe Pro Ala Met Leu Cys     Ser     #               11500 - #                1145     -      Ile Leu Met Thr Leu Leu Gly Ser - # Ser Asp Leu Glu Arg Leu Phe     Glu     #           11650     -      Asn Ala Val Ser Phe Ala Ser Phe - # Ser Phe Val Ser Val Val Pro     Asn     #       11805     -      Ser Leu Ala Phe Val Val Ser Trp - # Leu Phe Ser Cys Ser Ile Ser     Phe     #  12005     -      Phe Leu Phe Ser Gln Ile Ala His - # Glu Ala Ile Arg Lys Val Leu     Leu     # 12150     -      Leu Leu Leu Leu Arg Val Ser Phe - # Lys Pro Leu Arg Cys Cys Leu     Glu     #               12300 - #                1225     -      Asn Leu Leu Phe Ser Val Ser Ile - # Ser Ile Phe Ser Asn Lys Gly     Ala     #           12450     -      Arg Ile Phe Ser Leu Met Ser Leu - # Ile Lys His Phe Leu Asn Pro     Ser     #       12605     -      Ser Leu Leu Glu Ile Ile His Ser - # Leu Ile Met Ile Ser Ile Ser     Gly     #  12805     -      Val Leu His Ser Ser Gly Thr Ser - # Gly Ala Ala Gly Phe Thr Lys     Asp     # 12950     -      Ser Phe Thr Leu Leu Ile Ile Ser - # Ser Lys Leu Ser Ser Leu Leu     Asp     #               13100 - #                1305     -      Leu Val Ser Val Glu Glu Leu Pro - # Leu Gly Leu Ser Phe Lys Gly     Leu     #           13250     -      Ala Leu Ile Ser Phe Asn Ser Cys - # Asn Pro Leu Cys Leu Ala Ala     Phe     #       13405     -      Phe Thr Glu Glu Leu Phe Phe Phe - # Ser Asn Ala Ser Ile Phe Phe     Ile     #  13605     -      Cys Ser Ala Leu Ala Leu Ser Phe - # Arg Asn Phe Ser Leu Lys Ala     Leu     # 13750     -      Val Ala Asn Ser Phe Ser Ser Phe - # Val Ile Ser Leu Ser Ser Cys     Leu     #               13900 - #                1385     -      Ser Val Leu Ile Phe Leu Lys Pro - # Lys Ala Thr Asn Ser Leu Tyr     Ser     #           14050     -      Leu Gly Asn Asn Ala Val Leu Ile - # Ala Phe Lys Pro Pro Thr Thr     Lys     #       14205     -      Glu Arg Ala Thr Glu Pro Ser Phe - # Leu Lys Met Leu Phe Lys Ser     Val     #  14405     -      Ser Phe Pro Ile Gln Arg Asp Tyr - # Ser Trp Gln His Cys Phe Gly     Met     # 14550     -      Gln Lys Pro Leu Phe Arg Asn Ile - # Ser Cys Ser Val Leu Phe Ala     Leu     #               14700 - #                1465     -      Phe Cys Arg Ser Ser Asp Arg Tyr - # Leu Arg Lys Tyr Cys Ile His     Ser     #           14850     -      Phe Leu Ile Ile Phe Trp Ile Gly - # Tyr Gln Asn Ser Cys Leu Lys     Pro     #       15005     -      Thr Asn Thr Ile Phe Val Phe Val - # Arg Asn Ser Ile Phe Asp Trp     Ser     #  15205     -      Phe Ser Glu Tyr Ser Ile Pro His - # His Leu His Phe Ser Thr Ser     Lys     # 15350     -      Lys Pro His Ser Pro Leu Ser Ile - # Ser His Ser Tyr Gln Ser Pro     Cys     #               15500 - #                1545     -      Thr Pro Ser Val Pro Ser Lys Ala - # Cys Pro Phe Glu Gln Tyr Leu     Leu     #           15650     -      Leu Trp Ser Gln Thr Cys Phe Phe - # Gln Ala Val Ser Asn Asn Leu     Tyr     #       15805     -      Tyr Pro Ile Leu Phe Cys Ser Thr - # Asp Lys Cys Tyr Trp Asn Ile     Leu     #  16005     -      Gln Ser Leu Ala Cys Lys Ile Lys - # Leu Ile Gly Ser Leu Phe Cys     Phe     # 16150     -      Val Leu His Leu Lys Lys Thr His - # Pro Phe Glu Ala Lys Pro Glu     Pro     #               16300 - #                1625     -      Tyr Leu Trp Arg Phe Phe Lys Tyr - # Cys Ser Phe Ser Pro Gly Gly     Ser     #           16450     -      Asn Asn Ser Asn Ala Pro Tyr Leu - # Leu Trp Met Phe Leu Gly Leu     Lys     #       16605     -      His Arg His Lys Gly Glu Glu Val - # Ser Tyr Arg Glu Gly Tyr Leu     Leu     #  16805     -      Arg Leu Ala Leu Asp Pro His Lys - # Pro Leu Thr Gln Gly Phe Tyr     Leu     # 16950     -      Leu Ser Leu Ser Leu Glu Pro Leu - # Phe Leu Phe Cys Ile Cys Phe     Leu     #               17100 - #                1705     -      Ser Leu Lys Glu Leu Leu Tyr Asp - # Pro Ala Leu Phe     #            1720     - (2) INFORMATION FOR SEQ ID NO:16:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1786 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:16:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Ile Met Pro Met Thr Ser Cys - # Ile Ser Trp Thr Pro Leu Phe     Leu     #   15     -      Ile Gly Phe Arg Leu Ile Ala Leu - # Lys Leu Lys Arg Lys Ser Phe     Leu     #                 30     -      Lys Asn Ile Phe Lys Thr Leu Met - # Lys Gln Thr Lys Cys Ser Ile     Lys     #             45     -      Lys Ile Lys Thr Lys Ile Thr Ile - # Glu Trp Arg Cys Ala Arg Ile     Val     #         60     -      Leu Lys Ile Arg Ile Lys Glu Ser - # Lys Val Lys Arg Ile Phe Ile     Lys     #     80     -      Leu Asn Tyr Tyr Leu Leu Gly Val - # Leu Ser Leu Ala Cys Leu Gln     Ser     #   95     -      Val Asn Ala Asp Gln Asn Thr Asp - # Ile Lys Asp Ile Ser Pro Glu     Asp     #                110     -      Met Ala Leu Asn Ser Val Gly Leu - # Val Ser Arg Asp Gln Leu Lys     Ile     #            125     -      Glu Ile Pro Lys Glu Thr Leu Glu - # Gln Lys Val Ala Ile Leu Asn     Asp     #        140     -      Tyr Asn Asp Lys Asn Val Asn Ile - # Lys Phe Asp Asp Ile Ser Leu     Gly     #    160     -      Ser Phe Gln Pro Asn Asp Asn Leu - # Gly Ile Asn Ala Met Trp Gly     Ile     #   175     -      Gln Asn Leu Leu Met Ser Gln Met - # Met Gly Asp Tyr Gly Pro Asn     Asn     #                190     -      Pro Phe Met Tyr Gly Tyr Ala Pro - # Thr Tyr Ser Asp Ser Ser Phe     Leu     #            205     -      Pro Pro Ile Leu Gly Tyr Leu Gly - # Gly Val Asn Met Ala Gly Thr     Gln     #        220     -      Ala Ile Tyr Glu Ser Ser Ser Ala - # Gly Phe Leu Ser Glu Val Ser     Ser     #    240     -      Ile Ile Ser Thr Gln Val Val Ser - # Gln Asp His Leu Gln Glu Arg     Val     #   255     -      Leu Gln Gln Arg Leu Phe Leu Leu - # Leu Leu Gly Leu Leu Ile Arg     Lys     #                270     -      Pro Leu Ser Pro Asn Thr Ile Lys - # Ala Ser Leu Lys Pro Ala Cys     Leu     #            285     -      Lys Pro Leu Thr Asn Asn Thr Thr - # Asn Cys Phe Lys Val Leu Thr     Ile     #        300     -      Lys Ala Trp Leu Trp Gly Thr Ile - # Ser Ile Leu Ala Asn Gln Gly     Asn     #    320     -      Leu Thr His Lys Ala Thr Arg Lys - # Ala Arg Lys Leu Val Leu Leu     Ile     #   335     -      Val Arg Leu Gln Ala Phe Arg Val - # Thr Pro Pro Leu Lys Ile Pro     Val     #                350     -      Leu Leu Lys Leu Met Arg Gln Ile - # Lys Ser Leu Phe Leu Ala Cys     Gln     #            365     -      Gln Arg Leu Lys Arg Tyr Ile Arg - # Gln Tyr Tyr Lys Cys Ser Thr     Lys     #        380     -      Ile Arg Pro Ser Ser Thr Arg Ala - # Thr Ala Phe Ser Ser Lys Gln     Gln     #    400     -      Leu Asn Glu Tyr Thr Asp Ser Leu - # Ile Ala Val Pro Leu Val Arg     Leu     #   415     -      Leu Ala His Gln Lys Cys Phe Ser - # Val Ala Cys Lys Phe Leu Arg     Pro     #                430     -      Met Ser Val Ser Met Pro Thr Val - # Tyr Ala Lys Tyr Gln Ala Leu     Ala     #            445     -      Thr Asn Ala Leu Thr Ser Gly Val - # Asn Pro Met Thr Thr Pro Ala     Cys     #        460     -      Pro Ile Gly Asp Lys Val Leu Ala - # Val Tyr Cys Tyr Ala Glu Lys     Val     #    480     -      Ala Glu Ile Leu Arg Glu Tyr Tyr - # Ile Glu Phe Val Lys Asn Asn     Thr     #   495     -      Asn Leu Leu Gln Asn Ala Ser Gln - # Met Ile Leu Asn Gln Ser Gly     Leu     #                510     -      Ala Thr Ser Thr Tyr Asp Thr Gln - # Ala Ile Ser Asn Ile Ser Ser     Leu     #            525     -      Tyr Asn Tyr Asn Ile Val Ala Asn - # Lys Ser Phe Leu Lys Ser His     Leu     #        540     -      Thr Tyr Leu Asp Tyr Ile Lys Asp - # Lys Leu Lys Gly Gln Lys Asp     Ser     #    560     -      Tyr Leu Thr Glu Arg Val Gln Thr - # Lys Ile Ile Val Lys Gly Lys     Arg     #   575     -      Cys Glu Met Phe Phe Lys His Ile - # Phe Phe Leu Asp Phe Leu Trp     Phe     #                590     -      Val Ser Glu Trp Tyr Arg Gly Ser - # Asn Asn Ala Thr Cys Leu Lys     Ser     #            605     -      His Ser Gly Arg Arg Thr Ser Gln - # Thr Lys Asn Arg Ser Ser Arg     Ile     #        620     -      Lys Ser His Arg Ser Ser Ile Cys - # Gln Arg Lys Ser His Ser Ser     Ala     #    640     -      Asn Arg Gly Arg Ile Glu Asp Ser - # Ala Cys Asn His Glu Arg Tyr     Val     #   655     -      Lys Arg Gly Trp Arg Tyr Trp Cys - # Gln Trp His Asp Arg Gly Val     Phe     #                670     -      Cys Arg Phe Arg His Leu Ala Trp - # Arg His Gly Arg Val Phe Lys     Arg     #            685     -      Arg Leu Val His Trp Gly Gly Met - # Ser Lys Ser Trp Lys Asn Lys     Lys     #        700     -      Leu Ile Pro Lys Gln Lys Ser Lys - # Thr Cys Lys Ser Arg Ser Lys     Lys     #    720     -      Ile Thr Lys Ser Ser Ser Lys Thr - # Ser Lys Thr Leu Ser Lys Arg     Lys     #   735     -      Thr Ala Leu Asn Arg Gly Leu Pro - # Lys His Lys Trp Lys Phe Leu     Lys     #                750     -      Leu Glu Val Leu Leu Thr Thr Leu - # Asn Thr Ser Leu Thr Ile Arg     Leu     #            765     -      Val Ser Arg Arg Leu Lys Lys Pro - # Lys Val Leu Arg His Gly Lys     Gln     #        780     -      Leu Cys Lys Lys Arg Pro Leu Leu - # Ile Ile Ile Pro Gln Val Arg     Leu     #    800     -      Cys Phe Lys Val Pro Ser Met Arg - # Leu Leu Leu Val Phe Phe Gln     Ala     #   815     -      Trp Ala Thr Ile Trp Asn His Asn - # Ile His Ser Ser Ala His Lys     Met     #                830     -      Pro Leu Ser Trp Ala Lys Asn Phe - # Ser Met Pro Tyr Gly Thr Gly     Tyr     #            845     -      Ser Glu Leu Gln Asn Asp Cys Ser - # Pro Tyr Ser Lys Thr Glu Cys     Arg     #        860     -      Phe Ser Lys Tyr Arg Leu Phe Glu - # Trp Asp Gln Trp Arg Asn Gln     Asn     #    880     -      Leu Lys Arg Ile Ser Arg Phe Glu - # Cys Ala His Arg Ser Cys Asp     Leu     #   895     -      Lys Arg Arg Lys Ser Phe Lys Leu - # Tyr Arg Glu Ser Lys Lys Ser     Arg     #                910     -      Pro Leu Gly Arg Thr Thr Glu Ile - # Arg Lys Arg Asn Asp Ser Lys     Lys     #            925     -      His Cys Leu Ser Tyr Cys Gln Met - # Lys Thr Leu Val Lys Asn Thr     Ile     #        940     -      Ser Ser Phe Leu Leu Leu Ser Val - # Leu Met Ala Glu Asp Ile Thr     Ser     #    960     -      Gly Leu Lys Gln Leu Asp Ser Thr - # Tyr Gln Glu Thr Asn Gln Gln     Val     #   975     -      Leu Lys Asn Leu Asp Glu Ile Phe - # Ser Thr Thr Ser Pro Ser Ala     Asn     #                990     -      Asn Glu Ile Gly Gln Glu Asp Ala - # Leu Asn Ile Lys Lys Ala Ala     Ile     #           10050     -      Ala Leu Arg Gly Asp Leu Ala Leu - # Leu Lys Ala Asn Phe Glu Ala     Asn     #       10205     -      Glu Leu Phe Phe Ile Ser Glu Asp - # Val Ile Phe Lys Thr Tyr Met     Ser     #  10405     -      Ser Pro Glu Leu Leu Leu Thr Tyr - # Met Lys Ile Asn Pro Leu Asp     Gln     # 10550     -      Asn Thr Ala Glu Gln Gln Cys Gly - # Ile Ser Asp Lys Val Leu Val     Leu     #               10700 - #                1065     -      Tyr Cys Glu Gly Lys Leu Lys Ile - # Glu Gln Glu Lys Gln Asn Ile     Arg     #           10850     -      Glu Arg Leu Glu Thr Ser Leu Lys - # Ala Tyr Gln Ser Asn Ile Gly     Gly     #       11005     -      Thr Ala Ser Leu Ile Thr Ala Ser - # Gln Thr Leu Val Glu Ser Leu     Lys     #  11205     -      Asn Lys Asn Phe Ile Lys Gly Ile - # Arg Lys Leu Met Leu Ala Gln     Asn     # 11350     -      Lys Val Phe Leu Asn Tyr Leu Glu - # Glu Leu Asp Ala Leu Glu Arg     Ser     #               11500 - #                1145     -      Leu Glu Gln Ser Lys Arg Gln Tyr - # Leu Gln Glu Arg Gln Ser Ser     Lys     #           11650     -      Ile Ile Val Lys Phe Phe Asp Asp - # Arg Leu Phe Leu Thr Phe Gly     Ile     #       11805     -      Cys Phe Ser Ser Cys Phe Leu Ile - # Phe Gly Ser Leu Phe Phe Ser     Met     #  12005     -      Ser Asp Ala Leu Phe Tyr Leu Ile - # Asn Asp Ala Ile Gly Gln Phe     Gly     # 12150     -      Phe Arg Ala Ile Val Glu Cys Cys - # Ile Phe Cys Phe Ile Phe Phe     Cys     #               12300 - #                1225     -      Ile Ser Ser Ala Lys Leu Ala Cys - # Ile Cys Ser Ile Leu Val Val     Ile     #           12450     -      Phe Leu Leu His Phe Phe Phe Ser - # Leu Leu Thr Asn Ser Ala Arg     Ser     #       12605     -      Tyr Lys Lys Gly Ser Ile Val Ala - # Val Ile Lys Ser Glu Phe Thr     Thr     #  12805     -      Val Leu Ser Arg Lys Leu Val Ile - # Phe Cys Leu Tyr Phe Leu Asn     Phe     # 12950     -      Phe Gly Gly Lys Asn Phe Leu Ile - # Leu Asp Val Phe Asp Gln Thr     Leu     #               13100 - #                1305     -      Ser Pro Ile Ile Ile Val Ala Asn - # Asn Thr Leu Ile Asp Asn Asp     Phe     #           13250     -      Asn Phe Arg Arg Ser Pro Leu Ile - # Arg His Lys Arg Ser Ser Arg     Phe     #       13405     -      His Lys Arg Phe Ile His Leu Ile - # Asp His Leu Ile Lys Val Ile     Ile     #  13605     -      Phe Ile Arg Phe Gly Phe Ser Gly - # Arg Ile Ala Ile Arg Val Val     Ile     # 13750     -      Gln Arg Val Ser Val Asn Phe Val - # Phe Leu Ser Phe Val Leu Ser     Arg     #               13900 - #                1385     -      Phe Phe Leu Asn Arg Arg Val Val - # Leu Phe Phe Lys Arg Phe Asn     Ile     #           14050     -      Leu Leu Asn Leu Leu Cys Ser Arg - # Pro Ile Ile Ser Phe Phe Ser     Gln     #       14205     -      Ser Val Ser Gly Glu Phe Phe Phe - # Leu Phe Cys Asn Phe Phe Val     Ile     #  14405     -      Met Ser Leu Ile Glu Cys Phe Asp - # Phe Phe Lys Ala Gln Ser His     Phe     # 14550     -      Phe Val Phe Thr Arg Gln Cys Cys - # Ile Asp Ser Val Ala Pro Ile     Asn     #               14700 - #                1465     -      Asn Arg Ile Lys Ser Asp Arg Thr - # Phe Phe Glu Asp Ala Phe Gln     Glu     #           14850     -      Cys Leu Leu Ser Tyr Ser Lys Gly - # Leu Phe Leu Ala Thr Leu Phe     Arg     #       15005     -      Tyr Ala Glu Thr Thr Val Leu Lys - # Lys Leu Asn Ile Leu Cys Ser     Phe     #  15205     -      Cys Ile Val Val Leu Ser Leu Ile - # Ile Gly Ser Leu Ser Thr Lys     Ile     # 15350     -      Leu Tyr Ser Leu Val Phe Phe Asn - # Ile Asp Phe Leu Asp Trp Val     Ser     #               15500 - #                1545     -      Lys Leu Lys Leu Ser Lys Thr Asn - # Lys His Asn Phe Cys Phe Cys     Ala     #           15650     -      Lys Leu Asn Phe Leu Val Lys Phe - # Leu Val Arg Val Phe Asn Ser     Thr     #       15805     -      Ser Ser Leu Thr Phe Phe Asn Lys - # Leu Glu Lys Ala Ser Pro Phe     Lys     #  16005     -      Tyr Phe Ser Leu Ile Leu Ser Ile - # Ala Met Leu Asn Ala Lys Arg     Ser     # 16150     -      Glu Leu Lys Ile Ser Val Leu Val - # Ala Phe Ala Ile Ser Phe Ala     Leu     #               16300 - #                1625     -      Glu Ser Asn Leu Phe Leu Ser Ser - # Cys Lys Ser Ile Leu Ser Asn     Ser     #           16450     -      Phe Leu Leu Asn Arg Leu Glu Met - # Leu Leu Glu Tyr Ser Ser Ile     Ala     #       16605     -      Cys Leu Asp Lys Ala Asn Arg Leu - # Ala Phe Leu Phe Cys Ser Ser     Ser     #  16805     -      Lys Glu Asn Thr Pro Ile Ser Lys - # Ala Ala Leu Ser Leu Val Val     Ala     # 16950     -      Val Val Phe Leu Lys Ile Leu Leu - # Val Phe Thr Trp Gly Leu Lys     Arg     #               17100 - #                1705     -      Ser Leu Ser Ser Leu Val Asn Val - # Ser Trp Phe Lys Thr Ala Gly     Leu     #           17250     -      Gly Arg Ser Phe Leu Arg Gly Leu - # Ser Phe Lys Ala Cys Ser Arg     Ser     #       17405     -      Pro Ala Phe Asp Leu Ala Val Arg - # Ile Leu Pro Leu Val Ser Phe     Ser     #  17605     -      Arg Ala Ser Phe Ser Phe Leu Tyr - # Leu Phe Ser Lys Ser Glu Arg     Thr     # 17750     -      Leu Ala Leu Ser Ile Ser Ser Leu - # Ile Phe                      1780 - #                1785     - (2) INFORMATION FOR SEQ ID NO:17:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1529 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:17:(xi) SEQUENCE DESCRIPTION: SEQ     - AGGCAAAAAA CATGAAAACA TCACTAAATT CACTTGTCCT CATTGCAATA TC - #ACACACCA       60     - CAGAGATTAC AATGCGAGCG TCAATATTAG AAACTACGCT TTAGGCATGC TA - #GATGACAG      120     - GCATAAAATA AAGATAGATA AAAGTAGGGT AGGGATTATC CGAACTGATT AC - #GCTCATTA      180     - CACTGATGAG CGCATCAAAG CTTGTGGAGC TTCCTCTAAT GGGGTTATTT CT - #AAATATGG      240     - CAACATATTG GATCTAGCTA GTTATGGAGC GATGAAGCAA GAAAAAGCCC AA - #TCGCTTTA      300     - GCGTTGGTAA TTCACAAAAA GCAAGATCGG TTGTAAAAAT GCGTTACAAC TA - #AAATAAAG      360     - GGTCAAGATA ACTCATTTTC AAAAAGGAGT CTTAAGTAAT AAAATCATAA TG - #TTCAGCTA      420     - GTAATCTATT GCCTCGTTGA TCAAACAAAG CTCTGCGTGA AAGATGAAAA AA - #TTTCACCT      480     - TTAGATAGTT AATACACCAC TACAGTCTTA CTTGAGAGAC ACTCATTTTA TT - #AGCGGTTT      540     - TGTCTGATTT GCTGCTACCA AAACCATTAC CAACCAAAGC AGATCCCATG TT - #TTTGATAC      600     - TATCGAATCC ATTCTTCAGC ACTTCTGCCA TAAAATTCTT GATATTGTCC AT - #AGGCAAGT      660     - TAAATTTTTT CCCTAATGCT TCATTAAGTC CCATCATTAA CATCAGAAAG AA - #CAAAAAAT      720     - TTAATATCAT AGAAAACAAA TCACTGGATA AACCTGTAAA AAGATTTGTT CC - #GCCACCCA      780     - ACAAAGAAGC TAAAATTTTT CCCATGATCA GTCCTTTTAT TTTTGGTTGT GT - #AAGTTCTT      840     - GCTTGTTCGG ATCTCTAATG CGTGTTTTAG TAGGAAGCAT TTCACAATAG CA - #TACCTAAA      900     - GCTACTAAGA AAATTCTTGA ATCTATTGGT AAGATTACTC ATGAAATCAA GC - #GATAAGTA      960     - GCCACCAATC GCAAACAAAT CAAATATTTT GCCACCAAAC AAGCCATATC CT - #TTTTGTTT     1020     - TTATCTCCTA ATTATAGCAA ATTTTTATCA ATATTAATTT GGAAAACCAC CA - #CCATATCA     1080     - AAAACAAATT ACTAACACAC TAGATGCAGA ATTATTTTTA AAAAACGCTC AC - #TTAAATTT     1140     - AAAATCATGG GGTTTTAGGA TTTGAATACC AAAAATAGAT TGGTTTTTTC AA - #ATAAGCTA     1200     - GCTTTGTGTA TGCGCTTAAA AAGATTTTAG TTTTTAGTCA GTAAGGTTTT AT - #GCTAATGT     1260     - TTGGAAATAA AGAAATTTCT CTAAATCAAG TCTTGAGAAA TTTTTGAACG AA - #TCATAAGA     1320     - ACCAATTTTG CCATTGAGTC ATAAGTATGA TTAGCTTCAT TGTGAATTTT GC - #GTGGCTTA     1380     - AGAGATAGTA TTTGCTTATT ATGCTGAGAG AAACGAGTAG TAAAAGATAA GT - #AGTGTAAT     1440     - AAAAAAAGCT AGGTTTTATT ATAAGAGCGA ATAAGAATAA TATTGGATAA AC - #TAAAATCA     1500     #          1529    AAAA AGCCCTATT     - (2) INFORMATION FOR SEQ ID NO:18:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 464 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:18:(xi) SEQUENCE DESCRIPTION: SEQ     -      Arg Gln Lys Thr Lys His His Ile - # His Leu Ser Ser Leu Gln Tyr     His     #   15     -      Thr Pro Gln Arg Leu Gln Cys Glu - # Arg Gln Tyr Lys Leu Arg Phe     Arg     #                 30     -      His Ala Arg Gln Ala Asn Lys Asp - # Arg Lys Gly Arg Asp Tyr Pro     Asn     #             45     -      Leu Arg Ser Leu His Ala His Gln - # Ser Leu Trp Ser Phe Leu Trp     Gly     #         60     -      Tyr Phe Ile Trp Gln His Ile Gly - # Ser Ser Leu Trp Ser Asp Glu     Ala     #     80     -      Arg Lys Ser Pro Ile Ala Leu Ala - # Leu Val Ile His Lys Lys Gln     Asp     #   95     -      Arg Leu Lys Cys Val Thr Thr Lys - # Ile Lys Gly Gln Asp Asn Ser     Phe     #                110     -      Ser Lys Arg Ser Leu Lys Asn His - # Asn Val Gln Leu Val Ile Tyr     Cys     #            125     -      Leu Val Asp Gln Thr Lys Leu Cys - # Val Lys Asp Glu Lys Ile Ser     Pro     #        140     -      Leu Asp Ser Tyr Thr Thr Thr Val - # Leu Leu Glu Arg His Ser Phe     Tyr     #    160     -      Arg Phe Cys Leu Ile Cys Cys Tyr - # Gln Asn His Tyr Gln Pro Lys     Gln     #   175     -      Ile Pro Cys Phe Tyr Tyr Arg Ile - # His Ser Ser Ala Leu Leu Pro     Asn     #                190     -      Ser Tyr Cys Pro Ala Ser Ile Phe - # Ser Leu Met Leu His Val Pro     Ser     #            205     -      Leu Thr Ser Glu Arg Thr Lys Asn - # Leu Ile Ser Lys Thr Asn His     Trp     #        220     -      Ile Asn Leu Lys Asp Leu Phe Arg - # His Pro Thr Lys Lys Leu Lys     Phe     #    240     -      Phe Pro Ser Val Leu Leu Phe Leu - # Val Val Val Leu Ala Cys Ser     Asp     #   255     -      Leu Cys Val Phe Glu Ala Phe His - # Asn Ser Ile Pro Lys Ala Thr     Lys     #                270     -      Lys Ile Leu Glu Ser Ile Gly Lys - # Ile Thr His Glu Ile Lys Arg     Val     #            285     -      Ala Thr Asn Arg Lys Gln Ile Lys - # Tyr Phe Ala Thr Lys Gln Ala     Ile     #        300     -      Ser Phe Leu Phe Leu Ser Pro Asn - # Tyr Ser Lys Phe Leu Ser Ile     Leu     #    320     -      Ile Trp Lys Thr Thr Thr Ile Ser - # Lys Thr Asn Tyr His Thr Arg     Cys     #   335     -      Arg Ile Ile Phe Lys Lys Arg Ser - # Leu Lys Phe Lys Ile Met Gly     Phe     #                350     -      Asp Leu Asn Thr Lys Asn Arg Leu - # Val Phe Ser Asn Lys Leu Ala     Leu     #            365     -      Cys Met Arg Leu Lys Arg Phe Phe - # Leu Val Ser Lys Val Leu Cys     Cys     #        380     -      Leu Glu Ile Lys Lys Phe Leu Ile - # Lys Ser Glu Ile Phe Glu Arg     Ile     #    400     -      Ile Arg Thr Asn Phe Ala Ile Glu - # Ser Val Leu Ala Ser Leu Ile     Leu     #   415     -      Arg Gly Leu Arg Asp Ser Ile Cys - # Leu Leu Cys Glu Lys Arg Val     Val     #                430     -      Lys Asp Lys Cys Asn Lys Lys Ser - # Val Leu Leu Glu Arg Ile Arg     Ile     #            445     -      Ile Leu Asp Lys Leu Lys Ser Pro - # Leu Pro His Lys Lys Lys Ala     Leu     #        460     - (2) INFORMATION FOR SEQ ID NO:19:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 481 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:19:(xi) SEQUENCE DESCRIPTION: SEQ     -      Gly Lys Lys His Glu Asn Ile Thr - # Lys Phe Thr Cys Pro His Cys     Asn     #   15     -      Ile Thr His His Arg Asp Tyr Asn - # Ala Ser Val Asn Ile Arg Asn     Tyr     #                 30     -      Ala Leu Gly Met Leu Asp Asp Arg - # His Lys Ile Lys Ile Asp Lys     Ser     #             45     -      Arg Val Gly Ile Ile Arg Thr Asp - # Tyr Ala His Tyr Thr Asp Glu     Arg     #         60     -      Ile Lys Ala Cys Gly Ala Ser Ser - # Asn Gly Val Ile Ser Lys Tyr     Gly     #     80     -      Asn Ile Leu Asp Leu Ala Ser Tyr - # Gly Ala Met Lys Gln Glu Lys     Ala     #   95     -      Gln Ser Leu Arg Trp Phe Thr Lys - # Ser Lys Ile Gly Cys Lys Asn     Ala     #                110     -      Leu Gln Leu Lys Arg Val Lys Ile - # Thr His Phe Gln Lys Gly Val     Leu     #            125     -      Ser Asn Lys Ile Ile Met Phe Ser - # Ser Ile Ala Ser Leu Ile Lys     Gln     #        140     -      Ser Ser Ala Lys Met Lys Lys Phe - # His Leu Ile Val Asn Thr Pro     Leu     #    160     -      Gln Ser Tyr Leu Arg Asp Thr His - # Phe Ile Ser Gly Phe Val Phe     Ala     #   175     -      Ala Thr Lys Thr Ile Thr Asn Gln - # Ser Arg Ser His Val Phe Asp     Thr     #                190     -      Ile Glu Ser Ile Leu Gln His Phe - # Cys His Lys Ile Leu Asp Ile     Val     #            205     -      His Arg Gln Val Lys Phe Phe Pro - # Cys Phe Ile Lys Ser His His     His     #        220     -      Gln Lys Glu Gln Lys Ile Tyr His - # Arg Lys Gln Ile Thr Gly Thr     Cys     #    240     -      Lys Lys Ile Cys Ser Ala Thr Gln - # Gln Arg Ser Asn Phe Ser His     Asp     #   255     -      Gln Ser Phe Tyr Phe Trp Leu Cys - # Lys Phe Leu Leu Val Arg Ile     Ser     #                270     -      Asn Ala Cys Phe Ser Arg Lys His - # Phe Thr Ile Ala Tyr Leu Lys     Leu     #            285     -      Leu Arg Lys Phe Leu Asn Leu Leu - # Val Arg Leu Leu Met Lys Ser     Ser     #        300     -      Asp Lys Pro Pro Ile Ala Asn Lys - # Ser Asn Ile Leu Pro Pro Asn     Lys     #    320     -      Pro Tyr Pro Phe Cys Phe Tyr Leu - # Leu Ile Ile Ala Asn Phe Tyr     Gln     #   335     -      Tyr Phe Gly Lys Pro Pro Pro Tyr - # Gln Lys Gln Ile Thr Asn Thr     Leu     #                350     -      Asp Ala Glu Leu Phe Leu Lys Asn - # Ala His Leu Asn Leu Lys Ser     Trp     #            365     -      Gly Phe Arg Ile Ile Pro Lys Ile - # Asp Trp Phe Phe Gln Ile Ser     Leu     #        380     -      Cys Val Cys Ala Lys Asp Phe Ser - # Phe Ser Val Arg Phe Tyr Ala     Asn     #    400     -      Val Trp Lys Arg Asn Phe Ser Lys - # Ser Ser Leu Glu Lys Phe Leu     Asn     #   415     -      Glu Ser Glu Pro Ile Leu Pro Leu - # Ser His Lys Tyr Asp Leu His     Cys     #                430     -      Glu Phe Cys Val Ala Glu Ile Val - # Phe Ala Tyr Tyr Ala Glu Arg     Asn     #            445     -      Glu Lys Ile Ser Ser Val Ile Lys - # Lys Ala Arg Phe Tyr Tyr Lys     Ser     #        460     -      Glu Glu Tyr Trp Ile Asn Asn His - # Pro Cys Pro Ile Arg Lys Lys     Pro     #    480     -      Tyr     - (2) INFORMATION FOR SEQ ID NO:20:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 471 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:20:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ala Lys Asn Met Lys Thr Ser Leu - # Asn Ser Leu Val Leu Ile Ala     Ile     #   15     -      Ser His Thr Thr Glu Ile Thr Met - # Arg Ala Ser Ile Leu Glu Thr     Thr     #                 30     -      Leu Ala Cys Met Thr Gly Ile Lys - # Arg Ile Lys Val Gly Gly Leu     Ser     #             45     -      Glu Leu Ile Thr Leu Ile Thr Leu - # Met Ser Ala Ser Lys Leu Val     Glu     #         60     -      Leu Pro Leu Met Gly Leu Phe Leu - # Asn Met Ala Thr Tyr Trp Ile     Leu     #     80     -      Val Met Glu Arg Ser Lys Lys Lys - # Pro Asn Arg Phe Ser Val Gly     Asn     #   95     -      Ser Gln Lys Ala Arg Ser Val Val - # Lys Met Arg Tyr Asn Asn Lys     Gly     #                110     -      Ser Arg Leu Ile Phe Lys Lys Glu - # Ser Val Ile Lys Ser Cys Ser     Ala     #            125     -      Ser Asn Leu Leu Pro Arg Ser Asn - # Lys Ala Leu Arg Glu Arg Lys     Asn     #        140     -      Phe Thr Phe Arg Leu Ile His His - # Tyr Ser Leu Thr Glu Thr Leu     Ile     #    160     -      Leu Leu Ala Val Leu Ser Asp Leu - # Leu Leu Pro Lys Pro Leu Pro     Thr     #   175     -      Lys Ala Asp Pro Met Phe Leu Ile - # Leu Ser Asn Pro Phe Phe Ser     Thr     #                190     -      Ser Ala Ile Lys Phe Leu Ile Leu - # Ser Ile Gly Lys Leu Asn Phe     Phe     #            205     -      Pro Asn Ala Ser Leu Ser Pro Ile - # Ile Asn Ile Arg Lys Asn Lys     Lys     #        220     -      Phe Asn Ile Ile Glu Asn Lys Ser - # Leu Asp Lys Pro Val Lys Arg     Phe     #    240     -      Val Pro Pro Pro Asn Lys Glu Ala - # Lys Ile Phe Pro Met Ile Ser     Pro     #   255     -      Phe Ile Phe Gly Cys Val Ser Ser - # Cys Leu Phe Gly Ser Leu Met     Arg     #                270     -      Val Leu Val Gly Ser Ile Ser Gln - # His Thr Ser Tyr Glu Asn Ser     Ile     #            285     -      Tyr Trp Asp Tyr Ser Asn Gln Ala - # Ile Ser Ser His Gln Ser Gln     Thr     #        300     -      Asn Gln Ile Phe Cys His Gln Thr - # Ser His Ile Leu Phe Val Phe     Ile     #    320     -      Ser Leu Gln Ile Phe Ile Asn Ile - # Asn Leu Glu Asn His His His     Ile     #   335     -      Lys Asn Lys Leu Leu Thr His Met - # Gln Asn Tyr Phe Lys Thr Leu     Thr     #                350     -      Ile Asn His Gly Val Leu Gly Phe - # Glu Tyr Gln Lys Ile Gly Phe     Phe     #            365     -      Lys Ala Ser Phe Val Tyr Ala Leu - # Lys Lys Ile Leu Val Phe Ser     Gln     #        380     -      Gly Phe Met Leu Met Phe Gly Asn - # Lys Glu Ile Ser Leu Asn Gln     Val     #    400     -      Leu Arg Asn Phe Thr Asn His Lys - # Asn Gln Phe Cys His Val Ile     Ser     #   415     -      Met Ile Ser Phe Ile Val Asn Phe - # Ala Trp Leu Lys Arg Tyr Leu     Leu     #                430     -      Ile Met Leu Arg Glu Thr Ser Ser - # Lys Arg Val Val Lys Lys Leu     Gly     #            445     -      Phe Ile Ile Arg Ala Asn Lys Asn - # Asn Ile Gly Thr Lys Ile Thr     Pro     #        460     -      Ala Pro Glu Lys Ser Pro Ile     #    470     - (2) INFORMATION FOR SEQ ID NO:21:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1529 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:21:(xi) SEQUENCE DESCRIPTION: SEQ     - AATAGGGCTT TTTTCTTATG GGGCAGGGGT GATTTTAGTT TATCCAATAT TA - #TTCTTATT       60     - CGCTCTTATA ATAAAACCTA GCTTTTTTTA TTACACTACT TATCTTTTAC TA - #CTCGTTTC      120     - TCTCAGCATA ATAAGCAAAT ACTATCTCTT AAGCCACGCA AAATTCACAA TG - #AAGCTAAT      180     - CATACTTATG ACTCAATGGC AAAATTGGTT CTTATGATTC GTTCAAAAAT TT - #CTCAAGAC      240     - TTGATTTAGA GAAATTTCTT TATTTCCAAA CATTAGCATA AAACCTTACT GA - #CTAAAAAC      300     - TAAAATCTTT TTAAGCGCAT ACACAAAGCT AGCTTATTTG AAAAAACCAA TC - #TATTTTTG      360     - GTATTCAAAT CCTAAAACCC CATGATTTTA AATTTAAGTG AGCGTTTTTT AA - #AAATAATT      420     - CTGCATCTAG TGTGTTAGTA ATTTGTTTTT GATATGGTGG TGGTTTTCCA AA - #TTAATATT      480     - GATAAAAATT TGCTATAATT AGGAGATAAA AACAAAAAGG ATATGGCTTG TT - #TGGTGGCA      540     - AAATATTTGA TTTGTTTGCG ATTGGTGGCT ACTTATCGCT TGATTTCATG AG - #TAATCTTA      600     - CCAATAGATT CAAGAATTTT CTTAGTAGCT TTAGGTATGC TATTGTGAAA TG - #CTTCCTAC      660     - TAAAACACGC ATTAGAGATC CGAACAAGCA AGAACTTACA CAACCAAAAA TA - #AAAGGACT      720     - GATCATGGGA AAAATTTTAG CTTCTTTGTT GGGTGGCGGA ACAAATCTTT TT - #ACAGGTTT      780     - ATCCAGTGAT TTGTTTTCTA TGATATTAAA TTTTTTGTTC TTTCTGATGT TA - #ATGATGGG      840     - ACTTAATGAA GCATTAGGGA AAAAATTTAA CTTGCCTATG GACAATATCA AG - #AATTTTAT      900     - GGCAGAAGTG CTGAAGAATG GATTCGATAG TATCAAAAAC ATGGGATCTG CT - #TTGGTTGG      960     - TAATGGTTTT GGTAGCAGCA AATCAGACAA AACCGCTAAT AAAATGAGTG TC - #TCTCAAGT     1020     - AAGACTGTAG TGGTGTATTA ACTATCTAAA GGTGAAATTT TTTCATCTTT CA - #CGCAGAGC     1080     - TTTGTTTGAT CAACGAGGCA ATAGATTACT AGCTGAACAT TATGATTTTA TT - #ACTTAAGA     1140     - CTCCTTTTTG AAAATGAGTT ATCTTGACCC TTTATTTTAG TTGTAACGCA TT - #TTTACAAC     1200     - CGATCTTGCT TTTTGTGAAT TACCAACGCT AAAGCGATTG GGCTTTTTCT TG - #CTTCATCG     1260     - CTCCATAACT AGCTAGATCC AATATGTTGC CATATTTAGA AATAACCCCA TT - #AGAGGAAG     1320     - CTCCACAAGC TTTGATGCGC TCATCAGTGT AATGAGCGTA ATCAGTTCGG AT - #AATCCCTA     1380     - CCCTACTTTT ATCTATCTTT ATTTTATGCC TGTCATCTAG CATGCCTAAA GC - #GTAGTTTC     1440     - TAATATTGAC GCTCGCATTG TAATCTCTGT GGTGTGTGAT ATTGCAATGA GG - #ACAAGTGA     1500     #          1529    ATGT TTTTTGCCT     - (2) INFORMATION FOR SEQ ID NO:22:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 485 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:22:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ile Gly Leu Phe Ser Tyr Gly Ala - # Gly Val Ile Leu Val Tyr Pro     Ile     #   15     -      Leu Phe Leu Phe Ala Leu Ile Ile - # Lys Pro Ser Phe Phe Tyr Tyr     Thr     #                 30     -      Thr Tyr Leu Leu Leu Leu Val Ser - # Leu Ser Ile Ile Ser Lys Tyr     Tyr     #             45     -      Leu Leu Ser His Ala Lys Phe Thr - # Met Lys Leu Ile Ile Leu Met     Thr     #         60     -      Gln Trp Gln Asn Trp Phe Leu Phe - # Val Gln Lys Phe Leu Lys Thr     Phe     #     80     -      Arg Glu Ile Ser Leu Phe Pro Asn - # Ile Ser Ile Lys Pro Tyr Leu     Lys     #   95     -      Thr Lys Ile Phe Leu Ser Ala Tyr - # Thr Lys Leu Ala Tyr Leu Lys     Lys     #                110     -      Pro Ile Tyr Phe Trp Tyr Ser Asn - # Pro Lys Thr Pro Phe Ile Val     Ser     #            125     -      Val Phe Lys Phe Cys Ile Cys Val - # Ser Asn Leu Phe Leu Ile Trp     Trp     #        140     -      Trp Phe Ser Lys Leu Ile Leu Ile - # Lys Ile Cys Tyr Asn Glu Ile     Lys     #    160     -      Thr Lys Arg Ile Trp Leu Val Trp - # Trp Gln Asn Ile Phe Val Cys     Asp     #   175     -      Trp Trp Leu Leu Ile Ala Phe His - # Glu Ser Tyr Gln Ile Gln Glu     Phe     #                190     -      Ser Leu Val Cys Tyr Cys Glu Met - # Leu Pro Thr Lys Thr Arg Ile     Arg     #            205     -      Asp Pro Asn Lys Gln Glu Leu Thr - # Gln Pro Lys Ile Lys Gly Leu     Ile     #        220     -      Met Gly Lys Ile Leu Ala Ser Leu - # Leu Gly Gly Gly Thr Asn Leu     Phe     #    240     -      Thr Gly Leu Ser Ser Asp Leu Phe - # Ser Met Ile Leu Asn Phe Leu     Phe     #   255     -      Phe Leu Met Leu Met Met Gly Leu - # Asn Glu Ala Leu Gly Lys Lys     Phe     #                270     -      Asn Leu Pro Met Asp Asn Ile Lys - # Asn Phe Met Ala Glu Val Leu     Lys     #            285     -      Asn Gly Phe Asp Ser Ile Lys Asn - # Met Gly Ser Ala Leu Val Gly     Asn     #        300     -      Gly Phe Gly Ser Ser Lys Ser Asp - # Lys Thr Ala Asn Lys Met Ser     Val     #    320     -      Ser Gln Val Arg Leu Trp Cys Ile - # Asn Tyr Leu Lys Val Lys Phe     Phe     #   335     -      His Leu Ser Arg Arg Ala Leu Phe - # Asp Gln Arg Gly Asn Arg Leu     Leu     #                350     -      Ala Glu His Tyr Asp Phe Ile Thr - # Asp Ser Phe Leu Lys Met Ser     Tyr     #            365     -      Leu Asp Pro Leu Phe Leu Arg Ile - # Phe Thr Thr Asp Leu Ala Phe     Cys     #        380     -      Glu Leu Pro Thr Leu Lys Arg Leu - # Gly Phe Phe Leu Leu His Arg     Ser     #    400     -      Ile Thr Ser Ile Gln Tyr Val Ala - # Ile Phe Arg Asn Asn Pro Ile     Arg     #   415     -      Gly Ser Ser Thr Ser Phe Asp Ala - # Leu Ile Ser Val Met Ser Val     Ile     #                430     -      Ser Ser Asp Asn Pro Tyr Pro Thr - # Phe Ile Tyr Leu Tyr Phe Met     Pro     #            445     -      Val Ile His Ala Ser Val Val Ser - # Asn Ile Asp Ala Arg Ile Val     Ile     #        460     -      Ser Val Val Cys Asp Ile Ala Met - # Arg Thr Ser Glu Phe Ser Asp     Val     #    480     -      Phe Met Phe Phe Ala     #   485     - (2) INFORMATION FOR SEQ ID NO:23:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 469 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:23:(xi) SEQUENCE DESCRIPTION: SEQ     -      Asn Arg Ala Phe Phe Leu Trp Gly - # Arg Gly Asp Phe Ser Leu Ser     Asn     #   15     -      Ile Ile Leu Ile Arg Ser Tyr Asn - # Lys Thr Leu Phe Leu Leu His     Tyr     #                 30     -      Leu Ser Phe Thr Thr Arg Phe Ser - # Gln His Asn Lys Gln Ile Leu     Ser     #             45     -      Leu Lys Pro Arg Lys Ile His Asn - # Glu Ala Asn His Thr Tyr Asp     Ser     #         60     -      Met Ala Lys Leu Val Leu Met Ile - # Arg Ser Lys Ile Ser Gln Asp     Leu     #     80     -      Ile Arg Asn Phe Phe Ile Ser Lys - # His His Lys Thr Leu Leu Thr     Lys     #   95     -      Asn Asn Leu Phe Lys Arg Ile His - # Lys Ala Ser Leu Phe Glu Lys     Thr     #                110     -      Asn Leu Phe Leu Val Phe Lys Ser - # Asn Pro Met Ile Leu Asn Leu     Ser     #            125     -      Glu Arg Phe Leu Lys Ile Ile Leu - # His Leu Val Cys Phe Val Phe     Asp     #        140     -      Met Val Val Val Phe Gln Ile Asn - # Ile Asp Lys Asn Leu Leu Leu     Gly     #    160     -      Asp Lys Asn Lys Lys Asp Met Ala - # Cys Leu Val Ala Lys Tyr Leu     Ile     #   175     -      Cys Leu Arg Leu Val Ala Thr Tyr - # Arg Leu Ile Ser Val Ile Leu     Pro     #                190     -      Ile Asp Ser Arg Ile Phe Leu Val - # Ala Leu Gly Met Leu Leu Asn     Ala     #            205     -      Ser Tyr Asn Thr His Arg Ser Glu - # Gln Ala Arg Thr Tyr Thr Thr     Lys     #        220     -      Asn Lys Arg Thr Asp His Gly Lys - # Asn Phe Ser Phe Phe Val Gly     Trp     #    240     -      Arg Asn Lys Ser Phe Tyr Arg Phe - # Ile Gln Phe Val Phe Tyr Asp     Ile     #   255     -      Lys Phe Phe Val Leu Ser Asp Val - # Asn Asp Gly Thr Ser Ile Arg     Glu     #                270     -      Lys Ile Leu Ala Tyr Gly Gln Tyr - # Gln Glu Phe Tyr Gly Arg Ser     Ala     #            285     -      Glu Glu Trp Ile Arg Tyr Gln Lys - # His Gly Ile Cys Phe Gly Trp     Trp     #        300     -      Phe Trp Gln Gln Ile Arg Gln Asn - # Arg Asn Glu Cys Leu Ser Ser     Lys     #    320     -      Thr Val Val Val Tyr Leu Ser Lys - # Gly Glu Ile Phe Ser Ser Phe     Thr     #   335     -      Gln Ser Phe Val Ser Thr Arg Gln - # Ile Thr Ser Thr Leu Phe Tyr     Tyr     #                350     -      Leu Arg Leu Leu Phe Glu Asn Glu - # Leu Ser Pro Phe Ile Leu Val     Val     #            365     -      Thr His Phe Tyr Asn Arg Ser Cys - # Phe Leu Ile Thr Asn Ala Lys     Ala     #        380     -      Ile Gly Leu Phe Leu Ala Ser Ser - # Leu His Asn Leu Asp Pro Ile     Cys     #    400     -      Cys His Ile Lys Pro His Arg Lys - # Leu His Lys Leu Cys Ala His     Gln     #   415     -      Cys Asn Glu Arg Asn Gln Phe Gly - # Ser Leu Pro Tyr Phe Tyr Leu     Ser     #                430     -      Leu Phe Tyr Ala Cys His Leu Ala - # Cys Leu Lys Arg Ser Phe Tyr     Arg     #            445     -      Ser His Cys Asn Leu Cys Gly Val - # Tyr Cys Asn Glu Asp Lys Ile     Cys     #        460     -      Phe His Val Phe Cys          465     - (2) INFORMATION FOR SEQ ID NO:24:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 462 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     #ID NO:24:(xi) SEQUENCE DESCRIPTION: SEQ     -      Gly Phe Phe Leu Met Gly Gln Gly - # Phe Phe Ile Gln Tyr Tyr Ser     Tyr     #   15     -      Ser Leu Leu Asn Leu Ala Phe Phe - # Ile Thr Leu Leu Ile Phe Tyr     Tyr     #                 30     -      Ser Phe Leu Ser Ala Ala Asn Thr - # Ile Ser Ala Thr Gln Asn Ser     Gln     #             45     -      Ser Ser Tyr Leu Leu Asn Gly Lys - # Ile Gly Ser Tyr Asp Ser Phe     Lys     #         60     -      Asn Phe Ser Arg Leu Asp Leu Glu - # Lys Phe Leu Tyr Phe Gln Thr     Leu     #     80     -      Ala Asn Leu Thr Asp Lys Leu Lys - # Ser Phe Ala His Thr Gln Ser     Leu     #   95     -      Ile Lys Asn Gln Ser Ile Phe Gly - # Ile Gln Ile Leu Lys Pro His     Asp     #                110     -      Phe Lys Phe Lys Ala Phe Phe Lys - # Asn Asn Ser Ala Ser Ser Val     Leu     #            125     -      Val Ile Cys Phe Tyr Gly Gly Gly - # Phe Pro Asn Tyr Lys Phe Ala     Ile     #        140     -      Ile Arg Arg Lys Gln Lys Gly Tyr - # Gly Leu Phe Gly Gly Lys Ile     Phe     #    160     -      Asp Leu Phe Ala Ile Gly Gly Tyr - # Leu Ser Leu Asp Phe Met Ser     Asn     #   175     -      Leu Thr Asn Arg Phe Lys Asn Phe - # Leu Ser Ser Phe Arg Tyr Ala     Ile     #                190     -      Val Lys Cys Phe Leu Leu Lys His - # Ala Leu Glu Ile Arg Thr Ser     Lys     #            205     -      Asn Leu His Asn Gln Lys Lys Asp - # Ser Trp Glu Lys Phe Leu Leu     Cys     #        220     -      Trp Val Ala Glu Gln Ile Phe Leu - # Gln Val Tyr Pro Val Ile Cys     Phe     #    240     -      Leu Tyr Ile Phe Cys Ser Phe Cys - # Trp Asp Leu Met Lys His Gly     Lys     #   255     -      Asn Leu Thr Cys Leu Trp Thr Ile - # Ser Arg Ile Leu Trp Gln Lys     Cys     #                270     -      Arg Met Asp Ser Ile Val Ser Lys - # Thr Trp Asp Leu Leu Trp Leu     Val     #            285     -      Met Val Leu Val Ala Ala Asn Gln - # Thr Lys Pro Leu Ile Lys Val     Ser     #        300     -      Leu Lys Asp Cys Ser Gly Val Leu - # Thr Ile Arg Asn Phe Phe Ile     Phe     #    320     -      His Ala Glu Leu Cys Leu Ile Asn - # Glu Ala Ile Asp Tyr Leu Asn     Ile     #   335     -      Met Ile Leu Leu Leu Lys Thr Pro - # Phe Lys Val Ile Leu Thr Leu     Tyr     #                350     -      Phe Ser Cys Asn Ala Phe Leu Gln - # Pro Ile Leu Leu Phe Val Asn     Tyr     #            365     -      Gln Arg Ser Asp Trp Ala Phe Ser - # Cys Phe Ile Ala Pro Leu Ala     Arg     #        380     -      Ser Asn Met Leu Pro Tyr Leu Glu - # Ile Thr Pro Leu Glu Glu Ala     Pro     #    400     -      Gln Ala Leu Met Arg Ser Ser Val - # Ala Ser Val Arg Ile Ile Pro     Thr     #   415     -      Leu Leu Leu Ser Ile Phe Ile Leu - # Cys Leu Ser Ser Ser Met Pro     Lys     #                430     -      Ala Phe Leu Ile Leu Thr Leu Ala - # Leu Ser Leu Trp Cys Val Ile     Leu     #            445     -      Gln Gly Gln Val Asn Leu Val Met - # Phe Ser Cys Phe Leu Pro     #        460     - (2) INFORMATION FOR SEQ ID NO:25:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 19932 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: DNA (genomic)     #ID NO:25:(xi) SEQUENCE DESCRIPTION: SEQ     - GGATATATTG AACCATTAAG TGCAAATGAT CTATATCGCT TCCATCGCAA TG - #ATAATGAA       60     - ATGACTTTGG GTGGCTATCT CATCTATAAC AGACTTAATA AAGTTATTCA AA - #TCCCCTTG      120     - CAACAACTTT TTTCTATACT TACACACTAA AATCAAATGG GCTTTTAGAT TA - #TGCTTGCT      180     - TCGGTTGGTT GAAATATACC CCCTTAATGG GTAATGGTTT TTTCTCATTC CT - #CTATCCTT      240     - ATTCATTATT TATAAAAACA TTGTATAATA ATACAAGATA AAGATAAGGA GT - #TATTTTTC      300     - TTAACGCTAT CAAGTTTAGA ATTTATCCTA ACGCTCAACA AAAAGAGCTT AT - #TTCTAAAC      360     - ATTTTGGCTG TTCTAGGGTC GTGTATAACT ACTTTTTAGA TTACCGACAA AA - #GCAATACG      420     - CAAAAGGCTT AAAGAAACTT ACTTCACCAT GCAAAAAGTC TTAACCCAAA TC - #AAGCACCA      480     - AGAAAAATAC CATTACCTCA ATGAATGCAA TTCTCAAAGC TTGCAAATGG CG - #TTAAGACA      540     - GCTTGTGAGT GCTTATGATA ATTTCTTTAG CAAAAGAGCG AGATACCCTA AA - #TTCAAATC      600     - TAAAAAAAAA GCTAAACAAT CTTTTGCAAT CCCCCAAAAC ATAGAAATCA AA - #ACAGAGAC      660     - TCAAACCATC GCTCTCCCTA AATTCAAAGA GGGCATTAAG GCTAAATTAC AC - #AGAGAATT      720     - GCCTAAAGAT AGCGTTATCA AACAGGCTTT TATTTCTTGC ATAGCCGGTC AA - #TATTTTTG      780     - TTCTATATCC TATGAAACCA AAGAGCCTAT CCCTAAACCT ACCATCATTA AA - #AAAGCGGT      840     - AGGTTTAGAC ATGGGCTTAA GAACGCTCAT TGTTACAAGC GATAAAATAG AA - #TACCCACA      900     - CATCCGTTTT TATCAAAAAT TAGAAAAGAA ACTCACTAAA GCGGAAAGGA GG - #TTAAGTAA      960     - AAAAGTAAAA GGCTCCAACA ACAGGAAAAA ACAAGCTAAA AAGGTAGCTA GA - #TTGCATCT     1020     - AGCTTGTTCA AACACTAGAG ATGACTACTT GCATAAAATC AGTAATGAGA TA - #ACCAATCA     1080     - ATACGATTTG ATAGGGGTAG AAACTTTGAA TGTTAAGGGG CTTATGAGAA CC - #TATCATTC     1140     - TAAAAGCCTT GCTAATGCGA GTTGGGGGAA ATTCCTTACT ATGCTAAAAT AC - #AAAGCCCA     1200     - AAGAAAAGCT AAAACCCTAT TAGGCATAGA CAGATTTTTC CCTAGCTCTC AA - #TTGTGTTC     1260     - TTATTGTGGG TTCAATACAG GCAAAAAACA TGAAAACATC ACTAAATTCA CT - #TGTCCTCA     1320     - TTGCAATATC ACACACCACA GAGATTACAA TGCGAGCGTC AATATTAGAA AC - #TACGCTTT     1380     - AGGCATGCTA GATGACAGGC ATAAAATAAA GATAGATAAA AGTAGGGTAG GG - #ATTATCCG     1440     - AACTGATTAC GCTCATTACA CTGATGAGCG CATCAAAGCT TGTGGAGCTT CC - #TCTAATGG     1500     - GGTTATTTCT AAATATGGCA ACATATTGGA TCTAGCTAGT TATGGAGCGA TG - #AAGCAAGA     1560     - AAAAGCCCAA TCGCTTTAGC GTTGGTAATT CACAAAAAGC AAGATCGGTT GT - #AAAAATGC     1620     - GTTACAACTA AAATAAAGGG TCAAGATAAC TCATTTTCAA AAAGGAGTCT TA - #AGTAATAA     1680     - AATCATAATG TTCAGCTAGT AATCTATTGC CTCGTTGATC AAACAAAGCT CT - #GCGTGAAA     1740     - GATGAAAAAA TTTCACCTTT AGATAGTTAA TACACCACTA CAGTCTTACT TG - #AGAGACAC     1800     - TCATTTTATT AGCGGTTTTG TCTGATTTGC TGCTACCAAA ACCATTACCA AC - #CAAAGCAG     1860     - ATCCCATGTT TTTGATACTA TCGAATCCAT TCTTCAGCAC TTCTGCCATA AA - #ATTCTTGA     1920     - TATTGTCCAT AGGCAAGTTA AATTTTTTCC CTAATGCTTC ATTAAGTCCC AT - #CATTAACA     1980     - TCAGAAAGAA CAAAAAATTT AATATCATAG AAAACAAATC ACTGGATAAA CC - #TGTAAAAA     2040     - GATTTGTTCC CCCACCCAAC AAAGAAGCTA AAATTTTTCC CATGATCAGT CC - #TTTTATTT     2100     - TTGGTTGTGT AAGTTCTTGC TTGTTCTTAT CTCTAATGCG TGTTTTAGTA GG - #AAGCATTT     2160     - CACAATAGCA TACCTAAAGC TACTAAGAAA ATTCTTGAAT CTATTGGTAA GA - #TTACTCAT     2220     - GAAATCAAGC GATAAGTAGC CACCAATCGC AAACAAATCA AATATTTTGC CA - #CCAAACAA     2280     - GCCATATCCT TTTTGTTTTT ATCTCCTAAT TATAGCAAAT TTTTATCAAT AT - #TAATTTGG     2340     - AAAACCACCA CCATATCAAA AACAAATTAC TAACACACTA GATGCAGAAT TA - #TTTTTTAA     2400     - AAACGCGCAC TTAAATTTAA AATCATGGGG TTTTAGGATT TGAATACCAA AA - #ATAGATTG     2460     - GTTTTTTCAA ATAAGCTAGC TTTGTGTATG CGCTTAAAAA GATTTTGGTT TT - #TAGTCAGT     2520     - AAGGTTTTAT GCTAATGTTT GGAAATAAAG AAATTTCTCT AAATCAAGTC TT - #GAGAAATT     2580     - TTTGAACGAA TCATAAGAAC CAATTTTGCC ATTGAGTCAT AAGTATGATT AG - #CTTCATTG     2640     - TGAATTTTGC GTGGCTTAAG AGATAGTATT TGCTTATTAT GCTGAGAGAA AC - #GAGTAGTA     2700     - AAAGATAAGT AGTGTAATAA AAAAAGCTAG GTTTTATTAT AAGAGCGAAT AA - #GAATAATA     2760     - TTGGATAAAC TAAAATCACC CCTGCCCCAT AAGAAAAAAG CCCTATTAAA AA - #ACCTATAA     2820     - CGATAGAGCT GATATTGAAC AGCCTATAAT AAAGGCTGTA CTTATCTAAA TG - #TTTGTTGA     2880     - AAGAATATTT GAATTGTAAG AAGTTTTGTT TTAATTTGCT AATTTGGTTG GT - #TCCATTTT     2940     - GGTTTTTAAA GAAATAGTTC AGGGCGGTGA ACTTATAAAG GAGCATAAAA TA - #ATAAATAT     3000     - TTTACAAAAC CACCCTAATC TAACTCCAAA TCTCAAAGAA TACCCCACTT GC - #TGATACTA     3060     - GCATGTGGTA TAGCACAAAC CAACGATTTG TTTGTTTATG CCAAACAAAG AA - #CAGAAAAC     3120     - ATCTGTTAAA GAATAAAGGA GTTTTCCATC TAAAAAACAC AAATCTATTA TA - #TAGAAATA     3180     - ATCTTAAGAG AAACTTAAAA AATACCAACA AGCCGCATAC AAGCAAGAAA AA - #CATAACAC     3240     - TATAAGACCT GGATTTTATT TACCTTTTGG ATATGGAAAA ATCTTGATTC AT - #AGTTTTGT     3300     - AAAAATTGTG GTAAAATGCA TTGATATTCT TTGAAATTTT AAGGTTACAA AA - #ACTATAAG     3360     - ATGCTTGCAA AAATTGTTTT TAGCTCATTG GTTGCGTTTG GAGTTTTGTC GG - #CTAATGTG     3420     - GAGCAGTTTG GTTCATTTTT CAACGAGATA AAAAAAGAAC AAGAAGAAGT GG - #CTGCAAAA     3480     - GAAGACGCTC TTAAGGCTCG CAAGAAGCTC TTAAACAATA CGCATGATTT CT - #TAGAAGAC     3540     - TTGATTTTTA GAAAACAAAA AATCAAAGAG CTTATGGATC ATAGAGCTAA AG - #TTCTTTCA     3600     - GACTTAGAAA ACAAATACAA AAAAGAAAAA GAGGCTCTAG AGAAAGAGAC AA - #GAGGTAAA     3660     - ATCCTTACTG CTAAGTCAAA GGCTTATGGG GATCTAGAGC AAGCCTTAAA AG - #ATAACCCT     3720     - CTCTATAAGA AACTTCTTCC TAACCCTTAT GCCTATGTTT TAAACCAAGA AA - #CATTCACC     3780     - AAAGAAGATA AGGAGCGTTT GAGTTATTAC TACCCCCAGG TGAAAACGAG CA - #GTATTTTT     3840     - GAAAAAACTA CCGCTACCAC TAAAGATAAG GCTCAGGCTT TGCTTCAAAT GG - #GTGTGTTT     3900     - TCTTTAGATG AAGAACAAAA CAAAAAAGCG AGCCGATTAG CTTTATCTTA CA - #AGCAAGCG     3960     - ATTGAAGAAT ATTCCAATAA CATTTCTAAT CTGTTGAGCA GAAAAGAATT GG - #ATAATATA     4020     - GATTATTACT TACAGCTTGA AAGAAACAAG TTTGACTCCA AAGCAAAAGA TA - #TTGCTCAA     4080     - AAGGCTACTA ACACGCTTAT TTTTAACTCG GAACGCTTGG CGTTTAGCAT GG - #CGATTGAT     4140     - AAGATTAATG AGAAATACTT AAAGGGCTAT GAGGCTTTTT CTAACTTGTT GA - #AAAATGTC     4200     - AAAGATGATG TGGAATTGAA TACTCTGACT AAAAACTTTA CCAATCAAAA AT - #TGAGTTTC     4260     - GCACAAAAAC AAAAATTGTG TTTGTTGGTT TTAGACAGCT TCAATTTTGA TA - #CCCAATCC     4320     - AAAAAATCTA TATTAAAAAA GACTAATGAA TACAATATTT TCGTAGATAG CG - #ATCCTATG     4380     - ATGAGCGACA AAACAACTAT GCAAAAAGAA CACTACAAGA TATTTAATTT CT - #TCAAAACA     4440     - GTGGTTTCTG CATACCGAAA CAATGTTGCC AAGAATAATC CCTTTGAATA GG - #AAAGGAGA     4500     - CACTCTTGAA AAGCATCTTC AAAAAACTAG GTTCTGTCGC TCTTTATTCT TT - #AGTTGTTT     4560     - ATGGGGGCTT AAACGCTATC AATACAGCAT TATTGCCGAG TGAATACAAA GA - #ATTAGTGG     4620     - CTTTGGGCTT TAAAAAAATC AAAACACTCT ATCAAAGACA TGATGACAAA GA - #AATTACAA     4680     - AAGAGGAAAA AGAATTCGCC ACTAACGCTT TGAGAGAAAA ATTACGAAAT GA - #TAGGGCGA     4740     - GAGCAGAGCA AATTCAAAAG AATATTGAAG CGTTTGAAAA AAAGAACAAC TC - #TTCTGTTC     4800     - AAAAAAAAGC GGCTAAGCAC AAAGGATTAC AAGAATTAAA CGAAATTAAC GC - #TAACCCTT     4860     - TGAATGACAA CCCTAATGGC AATTCTTCCA CTGAAACCAA ATCTAATAAA GA - #TGATAACT     4920     - TTGATGAGAT GATCAATAAG GTGAATGAAT CTTTTGTGAA ACCTGCTGCT CC - #GCTTGTGC     4980     - CTGATGAGTG GAGAACGCCT GAAATTGAAA TCATTATCAA TGAGTGTATT AT - #TTCAAGCA     5040     - ACGATTATGA TGGGTTAAGA AAGTGTTTGA TCAAAGACAT CAAGGATCAA AA - #AATTCTTG     5100     - CCCCCTTATT AGAAAAAATT CAAGAAATAG AGACAGAAAA TAACAAGTTT TC - #TAGACAAC     5160     - ACCTAAGTGG TTTAAAACTC ACTCTTAATA ACAGCAACAA TAGAACCTTT CT - #TATAGCTT     5220     - CGTGCGCTAT TTGTGAGAAG AGAAAAAAAG AAATGGAGCA AGAAAATAAC TA - #CCAGGATA     5280     - CTACAAATGC AAGCGAGTTT GGCACTACTG ATACAAAAGA AAATGAAGCA AA - #AGATACAG     5340     - CATTCTCAAA CAATCGCTCT AAATCCGAAC TGCCCAATAG CGTCATTAAT CA - #AATAGAAC     5400     - AAAGCATCGC TCATGGAAAA AAATAGCGAT CCAAATTATT AGATCAAAAA AC - #AACTAGAG     5460     - AAGCAAATCC CAAAGGTTAG AAATCATAGC CTATCATCTC AGAAAAATCA TT - #TAACAATG     5520     - ATCTTACTTG ATTGCCTTTC TTGTAGGTAT TGTCGCTTAC TTTGTTCTAG GG - #ATCTTTCT     5580     - AATGCGTCCA ACTCCTCTAA ATAATTTAAA AAGACCTTGT TTTGAGCTAA CA - #TAAGCTTT     5640     - CTGATTCCTT TGATGAAATT TTTATTCTTT AGGCTTTCTA CAAGCGTCTG TG - #AAGCAGTG     5700     - ATTAAAGAAG CTGTACCTCC AATGTTGCTC TGATACGCCT TTAGGGAAGT TT - #CTAAACGC     5760     - TCTCTTATAT TTTGTTTTTC TTGCTCGATT TTCAGCTTCC CTTCACAATA AA - #GAACTAAA     5820     - ACTTTATCGG ATATTCCGCA TTGCTGCTCA GCAGTATTTT GGTCTAAGGG AT - #TGATTTTC     5880     - ATATAGGTTA ATAAAAGTTC AGGGCTAGAC ATATAAGTCT TGAAAATCAC AT - #CTTCTGAG     5940     - ATGAAAAATA ACTCATTCGC TTCAAAATTG GCTTTCAATA ACGCTAAATC TC - #CTCTCAAA     6000     - GCAATGGCCG CTTTTTTGAT GTTTAGAGCA TCTTCTTGAC CTATTTCATT AT - #TAGCGCTA     6060     - GGGCTAGTGG TTGAAAAAAT CTCATCTAAG TTTTTAAGCA CTTGTTGGTT GG - #TCTCTTGG     6120     - TAGGTGCTAT CAAGTTGCTT TAAACCGCTT GTTATATCTT CTCCCATCAA AA - #CAGACAAT     6180     - AGCAAAAAAG AAGATATGGT ATTTTTCACG AGTGTTTTCA TTTGACAATA AC - #TTTAGAGC     6240     - TAGCAATGTT TCTTGCTGTC GTTTCTCTTT CTAATTTCAG TTGTTCTTCC CA - #AAGGTCGG     6300     - CTTTTTTTTC AAGATTCTCT ATATAGTTTA AATGATTTTC TGCGTTTAAG AT - #CGCAACTT     6360     - CTATGAGCGC ATTCAAATCT ACTGATCCTT TTAAGGTTTT GATTTCTCCA TT - #GATCCCAT     6420     - TCAAATAAGC GATATTTTGA AAATCTGCAT CACTCAGTTT ATTTTGAATA AG - #GGCTACAA     6480     - TCATTCTGTA ATTCTGAATA ACCTGTTCCA TAAGGCATGC TGAAATTTTT AG - #CCCATCAA     6540     - GATAAGGGCA TTTTGTGGGC GCTAGAGTGA ATGTTTCAAT GATTCCAAAT GG - #TCGCCCAT     6600     - GCTTGAAAAA AAACTAAGAG CAGGCGCATA GATGGCACTT TGAAACAAAG CC - #TGACCTGT     6660     - TAGGGAATTA TAATCAATAA GGGTCGCTTT TTGCATAGCT GTTTTCAACC AT - #GTCTCAAA     6720     - ACCTTTTAAG GTTTCTTCAA ACGCCTTGAT ACCAATCGTA TTGTAAGCGA TG - #TATTGAGC     6780     - GTTGTCAGAA GAACTTCCTA GAGCTTGAGA AATTTCCATT TGTGTTTTTA GG - #GTAACCCT     6840     - CGGTTCAAAG CTGTTTTTTA ACGCTTCTAA GAGAGCGTTT TGCTGGTTCA TT - #TTGAGCTT     6900     - GATCATTTCG TTATTTTTTT GGAGCGCGAT TTGCATGTTT TGGATTTCTG TT - #TGGGTATT     6960     - AATTTTTTGT TTTTCCACGA TCATTTTGAC ATTCCCCCCC AATGCACTAA GC - #GCCGCTTG     7020     - AATACCCTTC CATGACGCCA AGCAAGATGT CTGAACCTGC AAAAAACCCC CC - #TGTCATGC     7080     - CATTGACACC ATTAATAACG CCATTAGCCC CTTTTAACAT AGCGCTCATG GT - #TGCAAGCT     7140     - GAGTCCTCAA TTCTCCCTCT ATTTGCGCTT GAATGGCTTT TTCTTTGGCA CT - #AGATTGAG     7200     - CTTCTATGGC TTTTAATTCG GCGTGAGCGG TTTTTTGTTT GGCTTGTGCG TC - #TGCCTGAA     7260     - TGGCTTTTAA GGCAGGTTCA AGCGTTATTA CTACCTCTGT ACCATTCAGA GA - #CAAACCAC     7320     - AAAAAGTCAA GAAAGAAAAT ATGCTTAAAA AACATTTCAC ATCTCTTTCC TC - #ACTTCACG     7380     - ATTATTTTAG TTTGCACCCT TTCTGTTAAG TAGCTATCTT TTTGCCCCTT AA - #GCTTGTCT     7440     - TTGATGTAAT CAAGGTAAGT CAAATGCGAT TTCAAAAAAG ATTTATTCGC TA - #CTATATTG     7500     - TAATTATATA GCGAACTTAT GTTAGAAATC GCTTGAGTGT CATAGGTGCT AG - #TAGCTAAT     7560     - CCTGATTGAT TAAGTATCAT TTGAGAAGCG TTCTGCAACA AATTGGTATT AT - #TTTTCACA     7620     - AATTCTATAT AGTATTCTCT CAAAATTTCT GCTACTTTTT CAGCATAGCA AT - #AAACAGCA     7680     - AGAACCTTGT CCCCAATAGG GCATGCAGGA GTGGTTATAG GATTAACGCC TG - #AAGTTAGG     7740     - GCATTAGTGC GTAACGCTTG GTATTTAGCA TAAACAGTGG GCATAGAAAC GC - #TCATGGGG     7800     - CGTCATAGAA ATTTGCATGC AACTGAAAAA CACTTTTGAT GAGCCAACAA GC - #GCACCTAA     7860     - AGCGGTACAG CTATCAAGGA ATCGGTGTAT CATTCATTGA GCTGTTGCTT GC - #TTGAGAAG     7920     - CCAGTTGCTC TTGTAGAGCT AGGGCGTATT TTGGTGCTGC ACTTGTAATA TT - #GCCTAATA     7980     - TACCGTCATC ATTTCAACCG TTGTTGGCAC GCTAGGAACA GCGATTTGAT TT - #GTCGCATA     8040     - AGCTTCAATA GCACTGGGAT TTTTAGGGGT GGTGTTACTC GCTAAAATGC TT - #GCAATCTG     8100     - ACTATTAACA GCACCAATTT GCGCGCCTTG GCTGTTGCCT TGAGCGTTAA AT - #TCCCCTGT     8160     - TAATTTGCTA ATATTTAAGA TATTGTTCCC CACAGCCATG CTTTGATCGT TA - #AAACCTTG     8220     - ATACAATTGG TTGTATTGTT GGTTAGCGGC TTTCATAGGC ATGCTTACGG CT - #TCAGCGAT     8280     - GCTTTGATTG TATTGGGTCA TGATAGCGGT CATTTGCGGA TTAGTAAACC CA - #ACAATAAT     8340     - AGGAATAATC GCTGCTGTCA TAGCACCCGC TACTATTCCT GCAAATGGTC CT - #GCGACACC     8400     - ACTTGTGTTG AGATGATTGA GGAAACTTCC GATAAGAAGC CTGCAGAAGA TG - #ATTCATAT     8460     - ATAGCTTGTG TACCTGCCAT GTTAACACCC CCTAGTTAAT ACCCTAATAT CG - #GTGGTAAA     8520     - AACGATGAAT CTGAGTATGT TGGTGCATAA CCATACATGA AAGGATTGTT TG - #GACCGTAA     8580     - TCGCCCATCA TTTGGCTCAT GAGAAGATTT TGAATGCCCC ACATCGCATT GA - #TACCTAGA     8640     - TTATCATTAG GTTGAAAACT CCCTAAACTT ATGTCGTCAA ATTTGATATT AA - #CATTTTTA     8700     - TCATTATAGT CATTGAGTAT GGCCACTTTT TGCTCTAGGG TTTCTTTAGG GA - #TCTCTATT     8760     - TTTAGTTGAT CTCTAGAAAC AAGCCCCACG CTATTTAGTG CCATATCTTC AG - #GACTAATA     8820     - TCTTTTATAT CAGTGTTTTG GTCAGCGTTA ACGGACTGTA AACATGCCAA TG - #ATAAGACA     8880     - CCAAGCAAAT AGTAATTTAA TTTTATAAAA ATCCGTTTTC ATACTTTTGA CT - #CCTTTATT     8940     - CTTATTTTTA GCACTATTCT AGCGCATTAA CGCCACTCAA TCGTTATTTT TG - #TTTTGATT     9000     - TTTTTGATCG AGCATTTTGT TTGTTACTTC ATCAATGTTT TGAAAATATT TT - #TCAAAAAG     9060     - CTCTTTCTTT TTAGCTTCAA CGCTCATATC AATCTGAATC CAATTAGGAA TA - #ATGGAGTC     9120     - CATGATTAAA TGCATGAAGT CATAGGCATG ATTTTTTGGG TATATTTTGT TC - #TGAACATA     9180     - GTATTCTAAA AAATTCGCTT GAACAAAAAA AATCTCTATA TCGCTCTGCA TA - #TCCTCGCT     9240     - TATGTTGTTA TTGATAGGTT TTTCTAGTAA TCTGAGAATC CTATACGACT GC - #ATGATAGT     9300     - TTCTAGCATG AAGTAAGCAT AAACATAAAC TAATGAGACA AAAGAATTGT TT - #GCTTTAAA     9360     - CGAGCTTGCG TCATTTTTCC CACTTTGAAG AGGAATTAAT CTTGATAATG TT - #TTTTGGGG     9420     - ATCTTGCCCA TCGTTTATTT CTTTAAAAAA GCGGAAATCT AAATCCTGAT TA - #CTGAGAAA     9480     - TGACTTGACA AAGTGAAGAT TAGCATTGAG ACTATCTATG AGACCTGAAT AA - #AGGTGCTC     9540     - TGTTTTGACA TCATCTATAT TTAAAACATT CTCATAAAAC ACATTGACAT GG - #TCTTCTAA     9600     - GAAATTAGAA AAGTCATAAA GAGTGGTAAG GTTTTGTTCA GTGATTTCGC CT - #TCCATTTC     9660     - TTCTTCTATG AAGTCCAATT CTTCTTTCAG TTCAAAAAGA TAATTAGAAA AA - #CTATCCAA     9720     - AATCGTCAAG ACATCATTTT CAAAATTTCC AATAATTTTT GTTCACGCAA AT - #TTTGTTTC     9780     - ATTTTAATAC TCCTCTATTT GTTGATACAT TTGTCTCAAG GCCTGATATT TA - #TCTATGAT     9840     - ACTATGGTTT TGGATAATCT TATCAATTTC TTTGACAAAT ACAGTATCTG TG - #GATAAAAT     9900     - TTTCAAATAT TCTTTAGGAA TGCCTCTCAA ATTAAAACTA GCGATAACGC TA - #GGGCTTCC     9960     - ATCCTGTTTG TAGAGGATTT TCCTATCTAG TCCCTTAGTG ATGATTTCAA AT - #TCTTTTTC     10020     - TGTAACATTA GCCAATCTTT GGTAATCAGA AAGATTGCCC CCATCGTTTC TC - #AAAAAAAT     10080     - CTTTGTAGGG CATTGTTCTC TAATCGTATC AGCAATAGGG CAAGCCAAAA GA - #TCAGTGAT     10140     - GCTTTGAGTC GCAAGTCTGA CAATAGCGTT TCTTTTCCTT GCAGTTTTTA GC - #ATGTCTCT     10200     - TACAAAATAA GCGACCTTTG GATCGCCTAA ATATTTCCAG GCTTCATCAA TA - #TCTAAGAC     10260     - AAATCTACGC CCATCCATTG CCTCTTGGAT ACGAGCGAAA AGGTAAAAAC AA - #ATAAAGGG     10320     - CGAAACATCA TTATTGTCTA AGAAACTTGA CCCATCAACG CCAATAATCG TT - #TTTGAAAA     10380     - ATCTAAGCGA TCTGTTGCTT TATTATCAAA AAGCCATTGA AATTCACCAT TG - #GTTGATTT     10440     - GCAAAAAGGC GCTAATCGCG CGACAAGCCC ATTAGGATCA TTGTGGTCTT TC - #CCGAAAGC     10500     - ATTAATAAGT TGAGTGATGG GATAATCTAG ATTCATATTT CCTGTGATAA GG - #TTGGTTAC     10560     - TGCGCTGCAA GCGTATTAGA ATCTGCTAGG CTAAAAGAGA TGCTGTTGCC AT - #TTTCATCT     10620     - TTTTCATCGC TTTTAGTTGC TAAGTTTTTC ACAAGCTCTT TGACAACAGA AA - #TAGCTGTT     10680     - TGTTTTTGCT CCATTGTTGC ATTTGTTTTT TGCACACAAG CCGCCCAAGC AA - #AAGGATTT     10740     - AATCCTGTAT CTGTCCCTAG CTCAATCTTG ACATACTCCC CACCCATTGC GA - #CAATATTC     10800     - CCATAAGCGC CATAATCTTT ATCCATATAA ACCATAGTGA GCTTTTGCTT GT - #CTTTGCTG     10860     - ACATTAGCAG GAAAATTATA GGCAAATTGT CCCATAGCGT TCAAGGTCAT TG - #ACATAAAC     10920     - ACTGTCTTAC CTGAACCGGT TGAGCCAAGT ATCAAAGTGT GTCCTGCTGA AG - #CTGAACCA     10980     - AAATCAGTGG GCATGTGGAA GTTCAGATAA AAAGGCGAAT TGATCTCGCT TT - #TTAGCGTC     11040     - ATCACACTAT TGCCCCAAGC GTTATTCTCT TGATTGCCAT CAAAACTCAT AG - #CCCTCATA     11100     - GCGATGAAAT CAGCAAAATT ATTAGAAGTT ACATCAAAAA TAAAAGGAAG CG - #TGATAAAA     11160     - GAGCAATGTT TGGCAAAAAA GTAATTTTCC ATAGAGAAAG TCGCTGCGTT GG - #CTAAAAAA     11220     - CCTTTAGCGT TAAGACTAGA GACGCATTCC TTAACGCTTT GTTTCATTTT TT - #CAAAGCTA     11280     - TCAGCAAACA GCACTAAAGA ATTACCATAA CTGCCTAGCG TAATATCACC AT - #TACCCACT     11340     - AATTCGCTCA AGCAACCTAA AGTCATGCCC TGTTCTTTAG AGCCTCCACT AA - #TAATAATT     11400     - CTTCTAGAGG TGAAAGCCAG TTTGTCCTTT AAAACCTGTG AGTTTTTAGG CG - #AATAAGCA     11460     - TGCATGAAAA TAAATTCGCT GTCTAGGGCG TTGATTTTAT CAAACAAATC GC - #TTTGTGAT     11520     - TTAGGGGCGT ATTCACTAAT CTCAATAGCG CTAAAATATT TTTCACTCAA AT - #CGTCATTT     11580     - AAGATTTTTC CATGCTTATT GGCAAAATAA ACTTCTTTCA CCCCACCATG CA - #TTTTTTCC     11640     - TTGAGATACA AGTCTTTTCG GTTGCAAATA AAAGGGGCTT CATTCATTCC CA - #CAAGAAAA     11700     - TTGTAAAATT CGCATTGTTT GGAGTAAATA ACGCCATCTT TAGTGTATTC TT - #TTAATCTA     11760     - GTGGGGTGGT ATTTGCTCAA CAGCTCTTCT ATGAGCTCTA TCCTATCCTT GA - #AGTTTTCA     11820     - AGCTTGGCTC TAATAATCCT TTGAAACTCT TCAAAATTAT TGTCTGCAAA AT - #GCTTTTTA     11880     - TTCATAACGG GTTCATTGAG AGTGTCTAAT AAATCTTGCT CTATGGTCAG AA - #AAAAACTA     11940     - ATATCATAAA AACTTTCTCT CTTTTGCTTC TCATTATAGG CTCGCATGAA AT - #CATTAGAA     12000     - AAAATAAGAC CATAGTCCCT ATTGGTTTCA TCAATAACGA TTTTCTTTTT AA - #TAGTGTGA     12060     - AAATAGAATT TGAATTCAGG GGTAACAAAA TTCCTAAAAA CGCTATAAAT AG - #AAGCGTGT     12120     - AACTCTATGA GATCTTTTTT GGAAGTGGTT AAAAAATCAA TGCCCCCCAA TT - #TGATTGTG     12180     - CCTAAAAGAG AATAGTTGTT AGTAAGGATC ACCCCATCAT CTAAAAAACA TT - #CATAGTTA     12240     - TTTGCTAGAT AGGAGTTTGC AGCGCTCACA AGTCTGTCTT CTCTGTTTGG AT - #TTAAGTGG     12300     - ATGTCATTAG CCATTTCTTT ACTAGGCTTC ATGGAAAAAA TGCTCATGAA CG - #CTTTGTTT     12360     - TTCACGCCCT TAAACAAAAA AGGTTTTTTA AATTTCATCG CTCGCTCCAT TC - #TTTGATAA     12420     - AGCCTATAAT CTTTCTTGAA TCCAAGAGCT ACAAGCACAA TAACAATCGC TA - #CAATCAAA     12480     - ACAGGTTCAT AGGCTTGAAA AAGAATAACA GATAATACAA TGGTTACAAA CA - #ATATAAAT     12540     - ATAGAGGAAT AAATAAAAGT TTCAGGGAAA CCAAACAACC TATTCCCCCC AT - #CAAACAAG     12600     - ACTTTAAAAA AGGGATTGAC ACCCTTTTGC ATGTCTGCTT TAAGTTCTTC TA - #TTTTTTGA     12660     - AACTGCCGCT TTTGAACCTC TTGCTCTATA ATTAGCTTTT TTTGTTCATC AG - #CCTGCTTG     12720     - CTTGCCACAA ACACCTCTCT CTTTATAGAT ATACCGCTTC ACATGTAATC GT - #ATAAAAGA     12780     - TTTTTTTGAG AGACTCTACG GTGCTAATAT GTTTCAAAAG ATCATTAGGA TC - #ATAAGAAT     12840     - TGAATACGGC CAATAAAACA TTATATAACT TATCATCGCA TAGAATTTCT CT - #TGTTTCTC     12900     - CGCGCAATGA CAGAAAGCAG CGTTGTTTGT TGGTCGTGCT GATGCTTTTG AA - #AGTAAAAA     12960     - AGTCTTTCAC TTCAGGATTG ATCTGTAATT CTACATTCAA TCCCATTTCC TT - #ACCCTTTT     13020     - CATCAAAGAT TTTTTCAATA ACTGGATCGT AATGCTTCAA ATCCTTTATT TT - #TTTAAGGA     13080     - CTCTATTGAC AATCACGAAG TCAAAAACTT CATCTTTGAT AATATCGGGA TT - #GACTTCTT     13140     - TGAAAGTTAC TTTCTTGTCT TTCAAATTTT TGATAGTCGC TTTGAAACTA TC - #AAAATCTA     13200     - AATTTGTATA AACAAGCCCA TTGGGAGTGT TTTTTTCTTT TTCTTGTGCT TC - #TTTTTTGG     13260     - CTTCTTTGTC ATCATTTGCT AACCCATACG AACTGAAAAC AACGAGACTT AA - #GAGAACTT     13320     - TCAAAAAAAA GCCTCTTAGT TTCTTATTGC TATTATTATT ATTGTTGATC AA - #CTTAGCTA     13380     - GCTCCTCCAC CCTCGCCAAT ATTGAAGCCA AACTTAGTGC TCAAATAGAT AA - #TACCGCCT     13440     - GCCACCGCTA ACATAGCTAT GGGTTGCGCG TAACGAAAAA CAGTCGCCTG AC - #CTCTTTTA     13500     - ATGTCATCAG AGATTTTCCA AATATCCGCT ATGCCTTTGA CCCCTAAAGC GC - #AACCACCT     13560     - ACGATCGCTA GAACAGAAAT GATCTGAATA ACCAAACCTT TAGTTGCAGT GA - #CGCCTTCT     13620     - GTAGGACTGG CGACCGCATT TAAAGGATTG GTTGTTACCA CTAGCCCTAA AG - #TTACTACA     13680     - ACTTTCTTGT AGCTGTCAGT GATTCTTGTA AAAAATTTCA TGCGTTTCCT TT - #CAAATTGA     13740     - AATCAATCGC TTGAGTATAT CAAAAAAAAA AGTATTTTTA TACTATTCAT AC - #AAGCGCTA     13800     - CTTTATAATT TAAATCAAAA CCGACGCTTT TGCTCGGCAA CTGACATCAT TC - #AGGAATAG     13860     - TAAACCTACT TGTCCCAACC ATTTTTCTTT CTCAAGTCGT TGTAGAATTG TA - #GATCTTTA     13920     - GGATCTTTGA TGTATTTTTT AATCGTCTCA GGTTGAAACC TAAAAACAAG CA - #AAAACAAA     13980     - CCCAAGCTGA TCAGAGTGAG AATAAAGCTC CATTTTAAGC AACTCCATAG AC - #CACTAAAG     14040     - AAACTTTTTT TGAGGCTATC TTTGAAAATC TGTCCTATTG ATTTGTTTTC CA - #TTTTGTTT     14100     - CCCATGTGGA TCTTGTGGAT CACAAACGCT TAATTATACA TGCTATAGTA AG - #CATGACAC     14160     - ACAAACCAAA CTATTTTTAG AACGCTTCAT GTGCTCACCT TGACTAACCA TT - #TCTCCAAC     14220     - CATACTTTAG CGTTGCATTT GATTTCTTCA AAAAGATTCA TTTCTTATTT CT - #TGTTCTTA     14280     - TTAAAGTTCT TTCATTTTAG CAAATTTTTG TTAATTGTGG GTAAAAATGT GA - #ATCGTCCT     14340     - AGCCTTTAGA CGCCTGCAAC GATCGGGCTT TTTTCAATAT TAATAATGAT TA - #ATGAAAAA     14400     - AAAAAAAAAT GCTTGATATT GTTGTATAAT GAGAATGTTC AAAGACATGA AT - #TGACTACT     14460     - CAAGCGTGTA GCGATTTTTA GCAGTCTTTG ACACTAACAA GATACCGATA GG - #TATGAAAC     14520     - TAGGTATAGT AAGGAGAAAC AATGACTAAC GAAACCATTG ACCAACAACC AC - #AAACCGAA     14580     - GCGGCTTTTA ACCCGCAGCA ATTTATCAAT AATCTTCAAG TAGCTTTTCT TA - #AAGTTGAT     14640     - AACGCTGTCG CTTCATACGA TCCTGATCAA AAACCAATCG TTGATAAGAA CG - #ATAGGGAT     14700     - AACAGGCAAG CTTTTGAAGG AATCTCGCAA TTAAGGGAAG AATACTCCAA TA - #AAGCGATC     14760     - AAAAATCCTA CCAAAAAGAA TCAGTATTTT TCAGACTTTA TCAATAAGAG CA - #ATGATTTA     14820     - ATCAACAAAG ACAATCTCAT TGATGTAGAA TCTTCCACAA AGAGCTTTCA GA - #AATTTGGG     14880     - GATCAGCGTT ACCGAATTTT CACAAGTTGG GTGTCCCATC AAAACGATCC GT - #CTAAAATC     14940     - AACACCCGAT CGATCCGAAA TTTTATGGAA AATATCATAC AACCCCCTAT CC - #TTGATGAT     15000     - AAAGAGAAAG CGGAGTTTTT GAAATCTGCC AAACAATCTT TTGCAGGAAT CA - #TTATAGGG     15060     - AATCAAATCC GAACGGATCA AAAGTTCATG GGCGTGTTTG ATGAGTCCTT GA - #AAGAAAGG     15120     - CAAGAAGCAG AAAAAAATGG AGAGCCTACT GGTGGGGATT GGTTGGATAT TT - #TTCTCTCA     15180     - TTTATATTTG ACAAAAAACA ATCTTCTGAT GTCAAAGAAG CAATCAATCA AG - #AACCAGTT     15240     - CCCCATGTCC AACCAGATAT AGCCACTACC ACCACCGACA TACAAGGCTT AC - #CGCCTGAA     15300     - GCTAGAGATT TACTTGATGA AAGGGGTAAT TTTTCTAAAT TCACTCTTGG CG - #ATATGGAA     15360     - ATGTTAGATG TTGAGGGAGT CGCTGACATT GATCCCAATT ACAAGTTCAA TC - #AATTATTG     15420     - ATTCACAATA ACGCTCTGTC TTCTGTGTTA ATGGGGAGTC ATAATGGCAT AG - #AACCTGAA     15480     - AAAGTTTCAT TGTTGTATGG GGGCAATGGT GGTCCTGGAG CTAGGCATGA TT - #GGAACGCC     15540     - ACCGTTGGTT ATAAAGACCA ACAAGGCAAC AATGTGGCTA CAATAATTAA TG - #TGCATATG     15600     - AAAAACGGCA GTGGCTTAGT CATAGCAGGT GGTGAGAAAG GGATTAACAA CC - #CTAGTTTT     15660     - TATCTCTACA AAGAAGACCA ACTCACAGGC TCACAACGAG CATTAAGTCA AG - #AAGAGATC     15720     - CAAAACAAAA TAGATTTCAT GGAATTTCTT GCACAAAATA ATGCTAAATT AG - #ACAACTTG     15780     - AGCGAGAAAG AGAAGGAAAA ATTCCGAACT GAGATTAAAG ATTTCCAAAA AG - #ACTCTAAG     15840     - GCTTATTTAG ACGCCCTAGG GAATGATCGT ATTGCTTTTG TTTCTAAAAA AG - #ACACAAAA     15900     - CATTCAGCTT TAATTACTGA GTTTGGTAAT GGGGATTTGA GCTACACTCT CA - #AAGATTAT     15960     - GGGAAAAAAG CAGATAAAGC TTTAGATAGG GAGAAAAATG TTACTCTTCA AG - #GTAGCCTA     16020     - AAACATGATG GCGTGATGTT TGTTGATTAT TCTAATTTCA AATACACCAA CG - #CCTCCAAG     16080     - AATCCCAATA AGGGTGTAGG CGTTACGAAT GGCGTTTCCC ATTTAGAAGT AG - #GCTTTAAC     16140     - AAGGTAGCTA TCTTTAATTT GCCTGATTTA AATAATCTCG CTATCACTAG TT - #TCGTAAGG     16200     - CGGAATTTAG AGGATAAACT AACCACTAAA GGATTGTCCC CACAAGAAGC TA - #ATAAGCTT     16260     - ATCAAAGATT TTTTGAGCAG CAACAAAGAA TTGGTTGGAA AAACTTTAAA CT - #TCAATAAA     16320     - GCTGTAGCTG ACGCTAAAAA CACAGGCAAT TATGATGAAG TGAAAAAAGC TC - #AGAAAGAT     16380     - CTTGAAAAAT CTCTAAGGAA ACGAGAGCAT TTAGAGAAAG AAGTAGAGAA AA - #AATTGGAG     16440     - AGCAAAAGCG GCAACAAAAA TAAAATGGAA GCAAAAGCTC AAGCTAACAG CC - #AAAAAGAT     16500     - GAGATTTTTG CGTTGATCAA TAAAGAGGCT AATAGAGACG CAAGAGCAAT CG - #CTTACGCT     16560     - CAGAATCTTA AAGGCATCAA AAGGGAATTG TCTGATAAAC TTGAAAATGT CA - #ACAAGAAT     16620     - TTGAAAGACT TTGATAAATC TTTTGATGAA TTCAAAAATG GCAAAAATAA GG - #ATTTCAGC     16680     - AAGGCAGAAG AAACACTAAA AGCCCTTAAA GGTTCGGTGA AAGATTTAGG TA - #TCAATCCA     16740     - GAATGGATTT CAAAAGTTGA AAACCTTAAT GCAGCTTTGA ATGAATTCAA AA - #ATGGCAAA     16800     - AATAAGGATT TCAGCAAGGT AACGCAAGCA AAAAGCGACC TTGAAAATTC CG - #TTAAAGAT     16860     - GTGATCATCA ATCAAAAGGT AACGGATAAA GTTGATAATC TCAATCAAGC GG - #TATCAGTG     16920     - GCTAAAGCAA CGGGTGATTT CAGTAGGGTA GAGCAAGCGT TAGCCGATCT CA - #AAAATTTC     16980     - TCAAAGGAGC AATTGGCCCA ACAAGCTCAA AAAAATGAAA GTCTCAATGC TA - #GAAAAAAA     17040     - TCTGAAATAT ATCAATCCGT TAAGAATGGT GTGAATGGAA CCCTAGTCGG TA - #ATGGGTTA     17100     - TCTCAAGCAG AAGCCACAAC TCTTTCTAAA AACTTTTCGG ACATCAAGAA AG - #AGTTGAAT     17160     - GCAAAACTTG GAAATTTCAA TAACAATAAC AATAATGGAC TCAAAAACGA AC - #CCATTTAT     17220     - GCTAAAGTTA ATAAAAAGAA AGCAGGGCAA GCAGCTAGCC TTGAAGAACC CA - #TTTACGCT     17280     - CAAGTTGCTA AAAAGGTAAA TGCAAAAATT GACCGACTCA ATCAAATAGC AA - #GTGGTTTG     17340     - GGTGTTGTAG GGCAAGCAGC GGGCTTCCCT TTGAAAAGGC ATGATAAAGT TG - #ATGATCTC     17400     - AGTAAGGTAG GGCTTTCAAG GAATCAAGAA TTGGCTCAGA AAATTGACAA TC - #TCAATCAA     17460     - GCGGTATCAG AAGCTAAAGC AGGTTTTTTT GGCAATCTAG AGCAAACGAT AG - #ACAAGCTC     17520     - AAAGATTCTA CAAAACACAA TCCCATGAAT CTATGGGTTG AAAGTGCAAA AA - #AAGTACCT     17580     - GCTAGTTTGT CAGCGAAACT AGACAATTAC GCTACTAACA GCCACATACG CA - #TTAATAGC     17640     - AATATCAAAA ATGGAGCAAT CAATGAAAAA GCGACCGGCA TGCTAACGCA AA - #AAAACCCT     17700     - GAGTGGCTCA AGCTCGTGAA TGATAAGATA GTTGCGCATA ATGTAGGAAG CG - #TTCCTTTG     17760     - TCAGAGTATG ATAAAATTGG CTTCAACCAG AAGAATATGA AAGATTATTC TG - #ATTCGTTC     17820     - AAGTTTTCCA CCAAGTTGAA CAATGCTGTA AAAGACACTA ATTCTGGCTT TA - #CGCAATTT     17880     - TTAACCAATG CATTTTCTAC AGCATCTTAT TACTGCTTGG CGAGAGAAAA TG - #CGGAGCAT     17940     - GGAATCAAGA ACGTTAATAC AAAAGGTGGT TTCCAAAAAT CTTAAAGGAT TA - #AGGAATAC     18000     - CAAAAACGCA AAAACCACCC CTTGCTAAAA GCGAGGGGTT TTTTAATACT CC - #TTAGCAGA     18060     - AATCCCAATC GTCTTTAGTA TTTGGGATGA ATGCTACCAA TTCATGGTAT CA - #TATCCCCA     18120     - TACATTCGTA TCTAGCGTAG GAAGTGTGCA AAGTTACGCC TTTGGAGATA TG - #ATGTGTGA     18180     - GACCTGTAGG GAATGCGTTG GAGCTCAAAC TCTGTAAAAT CCCTATTATA GG - #GACACAGA     18240     - GTGAGAACCA AACTCTCCCT ACGGGCAACA TCAGCCTAGG AAGCCCAATC GT - #CTTTAGCG     18300     - GTTGGGCACT TCACCTTAAA ATATCCCGAC AGACACTAAC GAAAGGCTTT GT - #TCTTTAAA     18360     - GTCTGCATGG ATATTTCCTA CCCCAAAAAG ACTTAACCCT TTGCTTAAAA TT - #AAGTTTGA     18420     - TTGTGCTAGT GGGTTCGTGC TATAGTGCGA AAATTAATTA AGGGTTATAA AG - #AGAGCATA     18480     - AACTAGAAAA AACAAGTAGC TATAACAAAG ATCAAGTTCA AAAAATCATA GA - #GCTTTTAG     18540     - AGCAAATTGA TCGCGCTCTT AACCAAAGAA AAATCAGAAA AACCATAGGA AT - #TATCACAC     18600     - CTTATAATGC CCAAAAAAGA CGCTTGCGAT CAGAAGTGGA AAAATACGGC TT - #CAAGAATT     18660     - TTGATGAGCT CAAAATAGAC ACTGTGGATG CCTTTCAAGG TGAAGAGGCA GA - #TATTATTA     18720     - TTTATTCCAC CGTGAAAACT TGTGGTAATC TTTCTTTCTT GCTAGATTCT AA - #ACGCTTGA     18780     - ATGTGGCTAT TTCTAGGGCA AAAGAAAATC TCATTTTTGT GGGTAAAAAG TC - #TTTCTTTG     18840     - AGAATTTATG AAGCGATGAG AAGAATATCT TTAGCGCTAT TTTGCAAGTC TG - #TAGATAGG     18900     - TAATCTTTTC CAAAGATAAT CATTAGACAT TCTTCGCTTC AAAACGCTTT CA - #TAAATCTC     18960     - TCTAAAGCGC TTTATAATCA ACACAATACC CTTATAGTGT GAGCTATAGC CC - #CTTTTTGG     19020     - GAATTGAGTT ATTTTGACTT TAAATTTTTA TTAGCGTTAC AATTTGAGCC AT - #TCTTTAGC     19080     - TTGTTTTTCT AGCCAGATCA CATCGCCGCT CGCATGAAAT TCCACTTTAG GG - #AATGCGTG     19140     - TGCATTTTTT TTAAGGGCGT ATTTTTGCTG CAAATATCCT ACAATAGCAT CG - #CCCGAATG     19200     - GATGAGTAGG GGGGGTGTTG AAAGGGCAAA ATGCTCCATA AAATAGCCCT CA - #ATTTTTTG     19260     - AGCGATTAAG GGAAAATGCG TGCAACCTAA AATAATCACT TCGGGAAAAT CT - #TTAAGGGA     19320     - GTGAAATAAT AACGCATGCA AGTTTCTAAC AATTCGCCCT CTAAAATACT TT - #CTTCAATC     19380     - AAAGGCACAA AAAGAGAAGT GGCTAAATGC GAAACATTCA AATAGCCTTG TT - #GTTTCAGG     19440     - GCATTGTCAT AAGCGTTGGA TTGGATCGTC GCTTTTGTCC CTAGCACTAA AA - #TAGGGGCG     19500     - TTTTTATCTT TTACTTGTCG CTTGATCGCT AAAATGCTTG GCTCAATCAC GC - #CCACAATA     19560     - GGGATTTTGG AATGCTTTTG CATCTCTTCT AAAGCTAGAG CGCTCGCTGT GT - #TGCATGCC     19620     - ACAATCAATA ATTCAATCTG GTGCGGTTTG AAAAAATCCA AAGCCTCTAA GC - #CAAATTGC     19680     - TTGATCGTAG TGGGGTCTTT AGTGCCATAA GGCACTCTAG CCGTATCGCC AT - #AATAGATG     19740     - ATTTCATCAA ATAATTGCGC TTTTAAAAGG CTTTTTAAAA CGCTAAACCC TC - #CCACACCG     19800     - CTATCAAAAA CGCCTATTTT CATGACACTT TTTTAATTTA ATGGGATTAA TT - #AGGGATTT     19860     - TATTTTTCAT TCATTAAGTT TAAAAATTCT TCATTGTCCT TAGTTTGTTG CA - #TTTTAGAA     19920     #    19932     - (2) INFORMATION FOR SEQ ID NO:26:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 396 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:26:(xi) SEQUENCE DESCRIPTION: SEQ     -      Leu Leu Phe Arg Leu Pro Thr Lys - # Ala Ile Arg Lys Arg Leu Lys     Glu     #   15     -      Thr Tyr Phe Thr Met Gln Lys Val - # Leu Thr Gln Ile Lys His Gln     Glu     #                 30     -      Lys Tyr His Tyr Leu Asn Glu Cys - # Asn Ser Gln Ser Leu Gln Met     Ala     #             45     -      Leu Arg Gln Leu Val Ser Ala Tyr - # Asp Asn Phe Phe Ser Lys Arg     Ala     #         60     -      Arg Tyr Pro Lys Phe Lys Ser Lys - # Lys Lys Ala Lys Gln Ser Phe     Ala     #     80     -      Ile Pro Gln Asn Ile Glu Ile Lys - # Thr Glu Thr Gln Thr Ile Ala     Leu     #   95     -      Pro Lys Phe Lys Glu Gly Ile Lys - # Ala Lys Leu His Arg Glu Leu     Pro     #                110     -      Lys Asp Ser Val Ile Lys Gln Ala - # Phe Ile Ser Cys Ile Ala Gly     Gln     #            125     -      Tyr Phe Cys Ser Ile Ser Tyr Glu - # Thr Lys Glu Pro Ile Pro Lys     Pro     #        140     -      Thr Ile Ile Lys Lys Ala Val Gly - # Leu Asp Met Gly Leu Arg Thr     Leu     #    160     -      Ile Val Thr Ser Asp Lys Ile Glu - # Tyr Pro His Ile Arg Phe Tyr     Gln     #   175     -      Lys Leu Glu Lys Lys Leu Thr Lys - # Ala Glu Arg Arg Leu Ser Lys     Lys     #                190     -      Val Lys Gly Ser Asn Asn Arg Lys - # Lys Gln Ala Lys Lys Val Ala     Arg     #            205     -      Leu His Leu Ala Cys Ser Asn Thr - # Arg Asp Asp Tyr Leu His Lys     Ile     #        220     -      Ser Asn Glu Ile Thr Asn Gln Tyr - # Asp Leu Ile Gly Val Glu Thr     Leu     #    240     -      Asn Val Lys Gly Leu Met Arg Thr - # Tyr His Ser Lys Ser Leu Ala     Asn     #   255     -      Ala Ser Trp Gly Lys Phe Leu Thr - # Met Leu Lys Tyr Lys Ala Gln     Arg     #                270     -      Lys Ala Lys Thr Leu Leu Gly Ile - # Asp Arg Phe Phe Pro Ser Ser     Gln     #            285     -      Leu Cys Ser Tyr Cys Gly Phe Asn - # Thr Gly Lys Lys His Glu Asn     Ile     #        300     -      Thr Lys Phe Thr Cys Pro His Cys - # Asn Ile Thr His His Arg Asp     Tyr     #    320     -      Asn Ala Ser Val Asn Ile Arg Asn - # Tyr Ala Leu Gly Met Leu Asp     Asp     #   335     -      Arg His Lys Ile Lys Ile Asp Lys - # Ser Arg Val Gly Ile Ile Arg     Thr     #                350     -      Asp Tyr Ala His Tyr Thr Asp Glu - # Arg Ile Lys Ala Cys Gly Ala     Ser     #            365     -      Ser Asn Gly Val Ile Ser Lys Tyr - # Gly Asn Ile Leu Asp Leu Ala     Ser     #        380     -      Tyr Gly Ala Met Lys Gln Glu Lys - # Ala Gln Ser Leu     #    395     - (2) INFORMATION FOR SEQ ID NO:27:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 124 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:27:(xi) SEQUENCE DESCRIPTION: SEQ     -      Glu Thr Leu Ile Leu Leu Ala Val - # Leu Ser Asp Leu Leu Leu Pro     Lys     #   15     -      Pro Leu Pro Thr Lys Ala Asp Pro - # Met Phe Leu Ile Leu Ser Asn     Pro     #                 30     -      Phe Phe Ser Thr Ser Ser Ala Ile - # Lys Phe Leu Ile Leu Ser Ile     Gly     #             45     -      Lys Leu Asn Phe Phe Pro Asn Ala - # Ser Leu Ser Pro Ile Ile Asn     Ile     #         60     -      Arg Lys Asn Lys Lys Phe Asn Ile - # Ile Glu Asn Lys Ser Leu Asp     Lys     #     80     -      Pro Val Lys Arg Phe Val Pro Pro - # Asn Lys Glu Ala Lys Ile Phe     Pro     #   95     -      Met Ile Ser Pro Phe Ile Phe Gly - # Cys Val Ser Ser Cys Leu Phe     Leu     #                110     -      Ser Leu Met Arg Val Leu Val Gly - # Ser Ile Ser Gln     #            120     - (2) INFORMATION FOR SEQ ID NO:28:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 382 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:28:(xi) SEQUENCE DESCRIPTION: SEQ     -      Gly Tyr Lys Asn Tyr Lys Met Leu - # Ala Lys Ile Val Phe Ser Ser     Leu     #   15     -      Val Ala Phe Gly Val Leu Ser Ala - # Asn Val Glu Gln Phe Gly Ser     Phe     #                 30     -      Phe Asn Glu Ile Lys Lys Glu Gln - # Glu Glu Val Ala Ala Lys Glu     Asp     #             45     -      Ala Leu Lys Ala Arg Lys Lys Leu - # Leu Asn Asn Thr His Asp Phe     Leu     #         60     -      Glu Asp Leu Ile Phe Arg Lys Gln - # Lys Ile Lys Glu Leu Met Asp     His     #     80     -      Arg Ala Lys Val Leu Ser Asp Leu - # Glu Asn Lys Tyr Lys Lys Glu     Lys     #   95     -      Glu Ala Leu Glu Lys Glu Thr Arg - # Gly Lys Ile Leu Thr Ala Lys     Ser     #                110     -      Lys Ala Tyr Gly Asp Leu Glu Gln - # Ala Leu Lys Asp Asn Pro Leu     Tyr     #            125     -      Lys Lys Leu Leu Pro Asn Pro Tyr - # Ala Tyr Val Leu Asn Gln Glu     Thr     #        140     -      Phe Thr Lys Glu Asp Lys Glu Arg - # Leu Ser Tyr Tyr Tyr Pro Gln     Val     #    160     -      Lys Thr Ser Ser Ile Phe Glu Lys - # Thr Thr Ala Thr Thr Lys Asp     Lys     #   175     -      Ala Gln Ala Leu Leu Gln Met Gly - # Val Phe Ser Leu Asp Glu Glu     Gln     #                190     -      Asn Lys Lys Ala Ser Arg Leu Ala - # Leu Ser Tyr Lys Gln Ala Ile     Glu     #            205     -      Glu Tyr Ser Asn Asn Ile Ser Asn - # Leu Leu Ser Arg Lys Glu Leu     Asp     #        220     -      Asn Ile Asp Tyr Tyr Leu Gln Leu - # Glu Arg Asn Lys Phe Asp Ser     Lys     #    240     -      Ala Lys Asp Ile Ala Gln Lys Ala - # Thr Asn Thr Leu Ile Phe Asn     Ser     #   255     -      Glu Arg Leu Ala Phe Ser Met Ala - # Ile Asp Lys Ile Asn Glu Lys     Tyr     #                270     -      Leu Lys Gly Tyr Glu Ala Phe Ser - # Asn Leu Leu Lys Asn Val Lys     Asp     #            285     -      Asp Val Glu Leu Asn Thr Leu Thr - # Lys Asn Phe Thr Asn Gln Lys     Leu     #        300     -      Ser Phe Ala Gln Lys Gln Lys Leu - # Cys Leu Leu Val Leu Asp Ser     Phe     #    320     -      Asn Phe Asp Thr Gln Ser Lys Lys - # Ser Ile Leu Lys Lys Thr Asn     Glu     #   335     -      Tyr Asn Ile Phe Val Asp Ser Asp - # Pro Met Met Ser Asp Lys Thr     Thr     #                350     -      Met Gln Lys Glu His Tyr Lys Ile - # Phe Asn Phe Phe Lys Thr Val     Val     #            365     -      Ser Ala Tyr Arg Asn Asn Val Ala - # Lys Asn Asn Pro Phe Glu     #        380     - (2) INFORMATION FOR SEQ ID NO:29:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 312 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:29:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ile Gly Lys Glu Thr Leu Leu Lys - # Ser Ile Phe Lys Lys Leu Gly     Ser     #   15     -      Val Ala Leu Tyr Ser Leu Val Val - # Tyr Gly Gly Leu Asn Ala Ile     Asn     #                 30     -      Thr Ala Leu Leu Pro Ser Glu Tyr - # Lys Glu Leu Val Ala Leu Gly     Phe     #             45     -      Lys Lys Ile Lys Thr Leu Tyr Gln - # Arg His Asp Asp Lys Glu Ile     Thr     #         60     -      Lys Glu Glu Lys Glu Phe Ala Thr - # Asn Ala Leu Arg Glu Lys Leu     Arg     #     80     -      Asn Asp Arg Ala Arg Ala Glu Gln - # Ile Gln Lys Asn Ile Glu Ala     Phe     #   95     -      Glu Lys Lys Asn Asn Ser Ser Val - # Gln Lys Lys Ala Ala Lys His     Lys     #                110     -      Gly Leu Gln Glu Leu Asn Glu Ile - # Asn Ala Asn Pro Leu Asn Asp     Asn     #            125     -      Pro Asn Gly Asn Ser Ser Thr Glu - # Thr Lys Ser Asn Lys Asp Asp     Asn     #        140     -      Phe Asp Glu Met Ile Asn Lys Val - # Asn Glu Ser Phe Val Lys Pro     Ala     #    160     -      Ala Pro Leu Val Pro Asp Glu Trp - # Arg Thr Pro Glu Ile Glu Ile     Ile     #   175     -      Ile Asn Glu Cys Ile Ile Ser Ser - # Asn Asp Tyr Asp Gly Leu Arg     Lys     #                190     -      Cys Leu Ile Lys Asp Ile Lys Asp - # Gln Lys Ile Leu Ala Pro Leu     Leu     #            205     -      Glu Lys Ile Gln Glu Ile Glu Thr - # Glu Asn Asn Lys Phe Ser Arg     Gln     #        220     -      His Leu Ser Gly Leu Lys Leu Thr - # Leu Asn Asn Ser Asn Asn Arg     Thr     #    240     -      Phe Leu Ile Ala Ser Cys Ala Ile - # Cys Glu Lys Arg Lys Lys Glu     Met     #   255     -      Glu Gln Glu Asn Asn Tyr Gln Asp - # Thr Thr Asn Ala Ser Glu Phe     Gly     #                270     -      Thr Thr Asp Thr Lys Glu Asn Glu - # Ala Lys Asp Thr Ala Phe Ser     Asn     #            285     -      Asn Arg Ser Lys Ser Glu Leu Pro - # Asn Ser Val Ile Asn Gln Ile     Glu     #        300     -      Gln Ser Ile Ala His Gly Lys Lys     #    310     - (2) INFORMATION FOR SEQ ID NO:30:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 131 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:30:(xi) SEQUENCE DESCRIPTION: SEQ     -      Val Cys Tyr Cys Glu Met Leu Pro - # Thr Lys Thr Arg Ile Arg Asp     Lys     #   15     -      Asn Lys Gln Glu Leu Thr Gln Pro - # Lys Ile Lys Gly Leu Ile Met     Gly     #                 30     -      Lys Ile Leu Ala Ser Leu Leu Gly - # Gly Gly Thr Asn Leu Phe Thr     Gly     #             45     -      Leu Ser Ser Asp Leu Phe Ser Met - # Ile Leu Asn Phe Leu Phe Phe     Leu     #         60     -      Met Leu Met Met Gly Leu Asn Glu - # Ala Leu Gly Lys Lys Phe Asn     Leu     #     80     -      Pro Met Asp Met Ile Lys Asn Phe - # Met Ala Glu Val Leu Lys Asn     Gly     #   95     -      Phe Asp Ser Ile Lys Asn Met Gly - # Ser Ala Leu Val Gly Asn Gly     Phe     #                110     -      Gly Ser Ser Lys Ser Asp Lys Thr - # Ala Asn Lys Met Ser Val Ser     Gln     #            125     -      Val Arg Leu              130     - (2) INFORMATION FOR SEQ ID NO:31:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 136 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:31:(xi) SEQUENCE DESCRIPTION: SEQ     -      Asn Ile Tyr Tyr Phe Met Leu Leu - # Tyr Lys Phe Thr Ala Leu Asn     Tyr     #   15     -      Phe Phe Lys Asn Gln Asn Gly Thr - # Asn Gln Ile Ser Lys Leu Lys     Gln     #                 30     -      Asn Phe Leu Gln Phe Lys Tyr Ser - # Phe Asn Lys His Leu Asp Lys     Tyr     #             45     -      Ser Leu Tyr Tyr Arg Leu Phe Asn - # Ile Ser Ser Ile Val Ile Gly     Phe     #         60     -      Leu Ile Gly Leu Phe Ser Tyr Gly - # Ala Gly Val Ile Leu Val Tyr     Pro     #     80     -      Ile Leu Phe Leu Phe Ala Leu Ile - # Ile Lys Pro Ser Phe Phe Tyr     Tyr     #   95     -      Thr Thr Tyr Leu Leu Leu Leu Val - # Ser Leu Ser Ile Ile Ser Lys     Tyr     #                110     -      Tyr Leu Leu Ser His Ala Lys Phe - # Thr Met Lys Leu Ile Ile Leu     Met     #            125     -      Thr Gln Trp Gln Asn Trp Phe Leu     #        135     - (2) INFORMATION FOR SEQ ID NO:32:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 79 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:32:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ile Lys Ser Arg Ser Tyr Ser Val - # Met Phe Phe Leu Leu Val Cys     Gly     #   15     -      Leu Leu Val Phe Phe Lys Phe Leu - # Leu Arg Leu Phe Leu Tyr Asn     Arg     #                 30     -      Phe Val Phe Phe Arg Trp Lys Thr - # Pro Leu Phe Phe Asn Arg Cys     Phe     #             45     -      Leu Phe Phe Val Trp His Lys Gln - # Thr Asn Arg Trp Phe Val Leu     Tyr     #         60     -      His Met Leu Val Ser Ala Ser Gly - # Val Phe Phe Glu Ile Trp Ser     #     75     - (2) INFORMATION FOR SEQ ID NO:33:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 241 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:33:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ser Tyr Cys Gln Met Lys Thr Leu - # Val Lys Asn Thr Ile Ser Ser     Phe     #   15     -      Leu Leu Leu Ser Val Leu Met Gly - # Glu Asp Ile Thr Ser Gly Leu     Lys     #                 30     -      Gln Leu Asp Ser Thr Tyr Gln Glu - # Thr Asn Gln Gln Val Leu Lys     Asn     #             45     -      Leu Asp Glu Ile Phe Ser Thr Thr - # Ser Pro Ser Ala Asn Asn Glu     Ile     #         60     -      Gly Gln Glu Asp Ala Leu Asn Ile - # Lys Lys Ala Ala Ile Ala Leu     Arg     #     80     -      Gly Asp Leu Ala Leu Leu Lys Ala - # Asn Phe Glu Ala Asn Glu Leu     Phe     #   95     -      Phe Ile Ser Glu Asp Val Ile Phe - # Lys Thr Tyr Met Ser Ser Pro     Glu     #                110     -      Leu Leu Leu Thr Tyr Met Lys Ile - # Asn Pro Leu Asp Gln Asn Thr     Ala     #            125     -      Glu Gln Gln Cys Gly Ile Ser Asp - # Lys Val Leu Val Leu Tyr Cys     Glu     #        140     -      Gly Lys Leu Lys Ile Glu Gln Glu - # Lys Gln Asn Ile Arg Glu Arg     Leu     #    160     -      Glu Thr Ser Leu Lys Ala Tyr Gln - # Ser Asn Ile Gly Gly Thr Ala     Ser     #   175     -      Leu Ile Thr Ala Ser Gln Thr Leu - # Val Glu Ser Leu Lys Asn Lys     Asn     #                190     -      Phe Ile Lys Gly Ile Arg Lys Leu - # Met Leu Ala Gln Asn Lys Val     Phe     #            205     -      Leu Asn Tyr Leu Glu Glu Leu Asp - # Ala Leu Glu Arg Ser Leu Glu     Gln     #        220     -      Ser Lys Arg Gln Tyr Leu Gln Glu - # Arg Gln Ser Ser Lys Ile Ile     Val     #    240     -      Lys     - (2) INFORMATION FOR SEQ ID NO:34:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 131 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:34:(xi) SEQUENCE DESCRIPTION: SEQ     -      Phe Phe Phe Lys His Gly Arg Pro - # Phe Gly Ile Ile Glu Thr Phe     Thr     #   15     -      Leu Ala Pro Thr Lys Cys Pro Tyr - # Leu Asp Gly Leu Lys Ile Ser     Ala     #                 30     -      Cys Leu Met Glu Gln Val Ile Gln - # Asn Tyr Arg Met Ile Val Ala     Leu     #             45     -      Ile Gln Asn Lys Leu Ser Asp Ala - # Asp Phe Gln Asn Ile Ala Tyr     Leu     #         60     -      Asn Gly Ile Asn Gly Glu Ile Lys - # Thr Leu Lys Gly Ser Val Asp     Leu     #     80     -      Asn Ala Leu Ile Glu Val Ala Ile - # Leu Asn Ala Glu Asn His Leu     Asn     #   95     -      Tyr Ile Glu Asn Leu Glu Lys Lys - # Ala Asp Leu Trp Glu Glu Gln     Leu     #                110     -      Lys Leu Glu Arg Glu Thr Thr Ala - # Arg Asn Ile Ala Ser Ser Lys     Val     #            125     -      Ile Val Lys              130     - (2) INFORMATION FOR SEQ ID NO:35:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 170 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:35:(xi) SEQUENCE DESCRIPTION: SEQ     -      Gln Gly Gly Phe Leu Gln Val Gln - # Thr Ser Cys Leu Ala Ser Trp     Lys     #   15     -      Gly Ile Gln Ala Ala Leu Ser Ala - # Leu Gly Gly Asn Val Lys Met     Ile     #                 30     -      Val Glu Lys Gln Lys Ile Asn Thr - # Gln Thr Glu Ile Gln Asn Met     Gln     #             45     -      Ile Ala Leu Gln Lys Asn Asn Glu - # Met Ile Lys Leu Lys Met Asn     Gln     #         60     -      Gln Asn Ala Leu Leu Glu Ala Leu - # Lys Asn Ser Phe Glu Pro Arg     Val     #     80     -      Thr Leu Lys Thr Gln Met Glu Ile - # Ser Gln Ala Leu Gly Ser Ser     Ser     #   95     -      Asp Asn Ala Gln Tyr Ile Ala Tyr - # Asn Thr Ile Gly Ile Lys Ala     Phe     #                110     -      Glu Glu Thr Leu Lys Gly Phe Glu - # Thr Trp Leu Lys Thr Ala Met     Gln     #            125     -      Lys Ala Thr Leu Ile Asp Tyr Asn - # Ser Leu Thr Gly Gln Ala Leu     Phe     #        140     -      Gln Ser Ala Ile Tyr Ala Pro Ala - # Leu Ser Phe Phe Ser Ser Met     Gly     #    160     -      Asp His Leu Glu Ser Leu Lys His - # Ser Leu     #   170     - (2) INFORMATION FOR SEQ ID NO:36:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 122 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:36:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ser Glu Glu Arg Asp Val Lys Cys - # Phe Leu Ser Ile Phe Ser Phe     Leu     #   15     -      Thr Phe Cys Gly Leu Ser Leu Asn - # Gly Thr Glu Val Val Ile Thr     Leu     #                 30     -      Glu Pro Ala Leu Lys Ala Ile Gln - # Ala Asp Ala Gln Ala Lys Gln     Lys     #             45     -      Thr Ala His Ala Glu Leu Lys Ala - # Ile Glu Ala Gln Ser Ser Ala     Lys     #         60     -      Glu Lys Ala Ile Gln Ala Gln Ile - # Glu Gly Glu Leu Arg Thr Gln     Leu     #     80     -      Ala Thr Met Ser Ala Met Leu Lys - # Gly Ala Asn Gly Val Ile Asn     Gly     #   95     -      Val Asn Gly Met Thr Gly Gly Phe - # Phe Ala Gly Ser Asp Ile Leu     Leu     #                110     -      Gly Val Met Glu Gly Tyr Ser Ser - # Gly Ala     #            120     - (2) INFORMATION FOR SEQ ID NO:37:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 143 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:37:(xi) SEQUENCE DESCRIPTION: SEQ     -      Arg Pro Met Ser Val Ser Met Pro - # Thr Val Tyr Ala Lys Tyr Gln     Ala     #   15     -      Leu Arg Thr Asn Ala Leu Thr Ser - # Gly Val Asn Pro Ile Thr Thr     Pro     #                 30     -      Ala Cys Pro Ile Gly Asp Lys Val - # Leu Ala Val Tyr Cys Tyr Ala     Glu     #             45     -      Lys Val Ala Glu Ile Leu Arg Glu - # Tyr Tyr Ile Glu Phe Val Lys     Asn     #         60     -      Asn Thr Asn Leu Leu Gln Asn Ala - # Ser Gln Met Ile Leu Asn Gln     Ser     #     80     -      Gly Leu Ala Thr Ser Thr Tyr Asp - # Thr Gln Ala Ile Ser Asn Ile     Ser     #   95     -      Ser Leu Tyr Asn Tyr Asn Ile Val - # Ala Asn Lys Ser Phe Leu Lys     Ser     #                110     -      His Leu Thr Tyr Leu Asp Tyr Ile - # Lys Asp Lys Leu Lys Gly Gln     Lys     #            125     -      Asp Ser Tyr Leu Thr Glu Arg Val - # Gln Thr Lys Ile Ile Val Lys     #        140     - (2) INFORMATION FOR SEQ ID NO:38:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 159 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:38:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ile Ile Phe Cys Arg Leu Leu Ile - # Gly Ser Phe Leu Asn His Leu     Asn     #   15     -      Thr Ser Gly Val Ala Gly Pro Phe - # Ala Gly Ile Val Ala Gly Ala     Met     #                 30     -      Thr Ala Ala Ile Ile Pro Ile Ile - # Val Gly Phe Thr Asn Pro Gln     Met     #             45     -      Thr Ala Ile Met Thr Gln Tyr Asn - # Gln Ser Ile Ala Glu Ala Val     Ser     #         60     -      Met Pro Met Lys Ala Ala Asn Gln - # Gln Tyr Asn Gln Leu Tyr Gln     Gly     #     80     -      Phe Asn Asp Gln Ser Met Ala Val - # Gly Asn Asn Ile Leu Asn Ile     Ser     #   95     -      Lys Leu Thr Gly Glu Phe Asn Ala - # Gln Gly Asn Ser Gln Gly Ala     Gln     #                110     -      Ile Gly Ala Val Asn Ser Gln Ile - # Ala Ser Ile Leu Ala Ser Asn     Thr     #            125     -      Thr Pro Lys Asn Pro Ser Ala Ile - # Glu Ala Tyr Ala Thr Asn Gln     Ile     #        140     -      Ala Val Pro Ser Val Pro Thr Thr - # Val Glu Met Met Thr Val Tyr     #    155     - (2) INFORMATION FOR SEQ ID NO:39:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 140 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:39:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Arg Ile Phe Ile Lys Leu Asn - # Tyr Tyr Leu Leu Gly Val Leu     Ser     #   15     -      Leu Ala Cys Leu Gln Ser Val Asn - # Ala Asp Gln Asn Thr Asp Ile     Lys     #                 30     -      Asp Ile Ser Pro Glu Asp Met Ala - # Leu Asn Ser Val Gly Leu Val     Ser     #             45     -      Arg Asp Gln Leu Lys Ile Glu Ile - # Pro Lys Glu Thr Leu Glu Gln     Lys     #         60     -      Val Ala Ile Leu Asn Asp Tyr Asn - # Asp Lys Asn Val Asn Ile Lys     Phe     #     80     -      Asp Asp Ile Ser Leu Gly Ser Phe - # Gln Pro Asn Asp Asn Leu Gly     Ile     #   95     -      Asn Ala Met Trp Gly Ile Gln Asn - # Leu Leu Met Ser Gln Met Met     Gly     #                110     -      Asp Tyr Gly Pro Asn Asn Pro Phe - # Met Tyr Gly Tyr Ala Pro Thr     Tyr     #            125     -      Ser Asp Ser Ser Phe Leu Pro Pro - # Ile Leu Gly Tyr     #        140     - (2) INFORMATION FOR SEQ ID NO:40:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 261 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:40:(xi) SEQUENCE DESCRIPTION: SEQ     -      Thr Lys Ile Ile Gly Asn Phe Glu - # Asn Asp Val Leu Thr Ile Leu     Asp     #   15     -      Ser Phe Ser Asn Tyr Leu Phe Glu - # Leu Lys Glu Glu Leu Asp Phe     Ile     #                 30     -      Glu Glu Glu Met Glu Gly Glu Ile - # Thr Glu Gln Asn Leu Thr Thr     Leu     #             45     -      Tyr Asp Phe Ser Asn Phe Leu Glu - # Asp His Val Asn Val Phe Tyr     Glu     #         60     -      Asn Val Leu Asn Ile Asp Asp Val - # Lys Thr Glu His Leu Tyr Ser     Gly     #     80     -      Leu Ile Asp Ser Leu Asn Ala Asn - # Leu His Phe Val Lys Ser Phe     Leu     #   95     -      Ser Asn Gln Asp Leu Asp Phe Arg - # Phe Phe Lys Glu Ile Asn Asp     Gly     #                110     -      Gln Asp Pro Gln Lys Thr Leu Ser - # Arg Leu Ile Pro Leu Gln Ser     Gly     #            125     -      Lys Asn Asp Ala Ser Ser Phe Lys - # Ala Asn Asn Ser Phe Val Ser     Leu     #        140     -      Val Tyr Val Tyr Ala Tyr Phe Met - # Leu Glu Thr Ile Met Gln Ser     Tyr     #    160     -      Arg Ile Leu Arg Leu Leu Glu Lys - # Pro Ile Asn Asn Asn Ile Ser     Glu     #   175     -      Asp Met Gln Ser Asp Ile Glu Ile - # Phe Phe Val Gln Ala Asn Phe     Leu     #                190     -      Glu Tyr Tyr Val Gln Asn Lys Ile - # Tyr Pro Lys Asn His Ala Tyr     Asp     #            205     -      Phe Met His Leu Ile Met Asp Ser - # Ile Ile Pro Asn Trp Ile Gln     Ile     #        220     -      Asp Met Ser Val Glu Ala Lys Lys - # Lys Glu Leu Phe Glu Lys Tyr     Phe     #    240     -      Gln Asn Ile Asp Glu Val Thr Asn - # Lys Met Leu Asp Gln Lys Asn     Gln     #   255     -      Asn Lys Asn Asn Asp                      260     - (2) INFORMATION FOR SEQ ID NO:41:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 263 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:41:(xi) SEQUENCE DESCRIPTION: SEQ     -      Tyr Ala Cys Ser Ala Val Thr Asn - # Leu Ile Thr Gly Asn Met Asn     Leu     #   15     -      Asp Tyr Pro Ile Thr Gln Leu Ile - # Asn Ala Phe Gly Lys Asp His     Asn     #                 30     -      Asp Pro Asn Gly Leu Val Ala Arg - # Leu Ala Pro Phe Cys Lys Ser     Thr     #             45     -      Asn Gly Glu Phe Gln Trp Leu Phe - # Asp Asn Lys Ala Thr Asp Arg     Leu     #         60     -      Asp Phe Ser Lys Thr Ile Ile Gly - # Val Asp Gly Ser Ser Phe Leu     Asp     #     80     -      Asn Asn Asp Val Ser Pro Phe Ile - # Cys Phe Tyr Leu Phe Ala Arg     Ile     #   95     -      Gln Glu Ala Met Asp Gly Arg Arg - # Phe Val Leu Asp Ile Asp Glu     Ala     #                110     -      Trp Lys Tyr Leu Gly Asp Pro Lys - # Val Ala Tyr Phe Val Arg Asp     Met     #            125     -      Leu Lys Thr Ala Arg Lys Arg Asn - # Ala Ile Val Arg Leu Ala Thr     Gln     #        140     -      Ser Ile Thr Asp Leu Leu Ala Cys - # Pro Ile Ala Asp Thr Ile Arg     Glu     #    160     -      Gln Cys Pro Thr Lys Ile Phe Leu - # Arg Asn Asp Gly Gly Asn Leu     Ser     #   175     -      Asp Tyr Gln Arg Leu Ala Asn Val - # Thr Glu Lys Glu Phe Glu Ile     Ile     #                190     -      Thr Lys Gly Leu Asp Arg Lys Ile - # Leu Tyr Lys Gln Asp Gly Ser     Pro     #            205     -      Ser Val Ile Ala Ser Phe Asn Leu - # Arg Gly Ile Pro Lys Glu Tyr     Leu     #        220     -      Lys Ile Leu Ser Thr Asp Thr Val - # Phe Val Lys Glu Ile Asp Lys     Ile     #    240     -      Ile Gln Asn His Ser Ile Ile Asp - # Lys Tyr Gln Ala Leu Arg Gln     Met     #   255     -      Tyr Gln Gln Ile Glu Glu Tyr                      260     - (2) INFORMATION FOR SEQ ID NO:42:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 683 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:42:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ser Gly Ile Ser Ile Lys Arg Glu - # Val Phe Val Ala Ser Lys Gln     Ala     #   15     -      Asp Glu Gln Lys Lys Leu Ile Ile - # Glu Gln Glu Val Gln Lys Arg     Gln     #                 30     -      Phe Gln Lys Ile Glu Glu Leu Lys - # Ala Asp Met Gln Lys Gly Val     Asn     #             45     -      Pro Phe Phe Lys Val Leu Phe Asp - # Gly Gly Asn Arg Leu Phe Gly     Phe     #         60     -      Pro Glu Thr Phe Ile Tyr Ser Ser - # Ile Phe Ile Leu Phe Val Thr     Ile     #     80     -      Val Leu Ser Val Ile Leu Phe Gln - # Ala Tyr Glu Pro Val Leu Ile     Val     #   95     -      Ala Ile Val Ile Val Leu Val Ala - # Leu Gly Phe Lys Lys Asp Tyr     Arg     #                110     -      Leu Tyr Gln Arg Met Glu Arg Ala - # Met Lys Phe Lys Lys Pro Phe     Leu     #            125     -      Phe Lys Gly Val Lys Asn Lys Ala - # Phe Met Ser Ile Phe Ser Met     Lys     #        140     -      Pro Ser Lys Glu Met Ala Asn Asp - # Ile His Leu Asn Pro Asn Arg     Glu     #    160     -      Asp Arg Leu Val Ser Ala Ala Asn - # Ser Tyr Leu Ala Asn Asn Tyr     Glu     #   175     -      Cys Phe Leu Asp Asp Gly Val Ile - # Leu Thr Asn Asn Tyr Ser Leu     Leu     #                190     -      Gly Thr Ile Lys Leu Gly Gly Ile - # Asp Phe Leu Thr Thr Ser Lys     Lys     #            205     -      Asp Leu Ile Glu Leu His Ala Ser - # Ile Tyr Ser Val Phe Arg Asn     Phe     #        220     -      Val Thr Pro Glu Phe Lys Phe Tyr - # Phe His Thr Ile Lys Lys Lys     Ile     #    240     -      Val Ile Asp Glu Thr Asn Arg Asp - # Tyr Gly Leu Ile Phe Ser Asn     Asp     #   255     -      Phe Met Arg Ala Tyr Asn Glu Lys - # Gln Lys Arg Glu Ser Phe Tyr     Asp     #                270     -      Ile Ser Phe Phe Leu Thr Ile Glu - # Gln Asp Leu Leu Asp Thr Leu     Asn     #            285     -      Glu Pro Val Met Asn Lys Lys His - # Phe Ala Asp Asn Asn Phe Glu     Glu     #        300     -      Phe Gln Arg Ile Ile Arg Ala Lys - # Leu Glu Asn Phe Lys Asp Arg     Ile     #    320     -      Glu Leu Ile Glu Glu Leu Leu Ser - # Lys Tyr His Pro Thr Arg Leu     Lys     #   335     -      Glu Tyr Thr Lys Asp Gly Val Ile - # Tyr Ser Lys Gln Cys Glu Phe     Tyr     #                350     -      Asn Phe Leu Val Gly Met Asn Glu - # Ala Pro Phe Ile Cys Asn Arg     Lys     #            365     -      Asp Leu Tyr Leu Lys Glu Lys Met - # His Gly Gly Val Lys Glu Val     Tyr     #        380     -      Phe Ala Asn Lys His Gly Lys Ile - # Leu Asn Asp Asp Leu Ser Glu     Lys     #    400     -      Tyr Phe Ser Ala Ile Glu Ile Ser - # Glu Tyr Ala Pro Lys Ser Gln     Ser     #   415     -      Asp Leu Phe Asp Lys Ile Asn Ala - # Leu Asp Ser Glu Phe Ile Phe     Met     #                430     -      His Ala Tyr Ser Pro Lys Asn Ser - # Gln Val Leu Lys Asp Lys Leu     Ala     #            445     -      Phe Thr Ser Arg Arg Ile Ile Ile - # Ser Gly Gly Ser Lys Glu Gln     Gly     #        460     -      Met Thr Leu Gly Cys Leu Ser Glu - # Leu Val Gly Asn Gly Asp Ile     Thr     #    480     -      Leu Gly Ser Tyr Gly Asn Ser Leu - # Val Leu Phe Ala Asp Ser Phe     Glu     #   495     -      Lys Met Lys Gln Ser Val Lys Glu - # Cys Val Ser Ser Leu Asn Ala     Lys     #                510     -      Gly Phe Leu Ala Asn Ala Ala Thr - # Phe Ser Met Glu Asn Tyr Phe     Phe     #            525     -      Ala Lys His Cys Ser Phe Ile Thr - # Leu Pro Phe Ile Phe Asp Val     Thr     #        540     -      Ser Asn Asn Phe Ala Asp Phe Ile - # Ala Met Arg Ala Met Ser Phe     Asp     #    560     -      Gly Asn Gln Glu Asn Asn Ala Trp - # Gly Asn Ser Val Met Thr Leu     Lys     #   575     -      Ser Glu Ile Asn Ser Pro Phe Tyr - # Leu Asn Phe His Met Pro Thr     Asp     #                590     -      Phe Gly Ser Ala Ser Ala Gly His - # Ala Tyr Gly Asn Ile Val Ala     Met     #            605     -      Gly Gly Glu Tyr Val Lys Ile Glu - # Leu Gly Thr Asp Thr Gly Leu     Asn     #        620     -      Pro Phe Ala Trp Ala Ala Cys Val - # Gln Lys Thr Asn Ala Thr Met     Glu     #    640     -      Gln Lys Gln Thr Ala Ile Ser Val - # Val Lys Glu Leu Val Lys Asn     Leu     #   655     -      Ala Thr Lys Ser Asp Glu Lys Asp - # Glu Asn Gly Asn Ser Ile Ser     Phe     #                670     -      Ser Leu Ala Asp Ser Asn Thr Leu - # Ala Ala Gln     #            680     - (2) INFORMATION FOR SEQ ID NO:43:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 225 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:43:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ala Leu Ser Leu Ala Ser Ile Leu - # Ala Arg Val Glu Glu Leu Ala     Lys     #   15     -      Leu Ile Asn Asn Asn Asn Asn Ser - # Asn Lys Lys Leu Arg Gly Phe     Phe     #                 30     -      Leu Lys Val Leu Leu Ser Leu Val - # Val Phe Ser Ser Tyr Gly Leu     Ala     #             45     -      Asn Asp Asp Lys Glu Ala Lys Lys - # Glu Ala Gln Glu Lys Glu Lys     Asn     #         60     -      Thr Pro Asn Gly Leu Val Tyr Thr - # Asn Leu Asp Phe Asp Ser Phe     Lys     #     80     -      Ala Thr Ile Lys Asn Leu Lys Asp - # Lys Lys Val Thr Phe Lys Glu     Val     #   95     -      Asn Pro Asp Ile Ile Lys Asp Glu - # Val Phe Asp Phe Val Ile Val     Asn     #                110     -      Arg Val Leu Lys Lys Ile Lys Asp - # Leu Lys His Tyr Asp Pro Val     Ile     #            125     -      Glu Lys Ile Phe Asp Glu Lys Gly - # Lys Glu Met Gly Leu Asn Val     Glu     #        140     -      Leu Gln Ile Asn Pro Glu Val Lys - # Asp Phe Phe Thr Phe Lys Ser     Ile     #    160     -      Ser Thr Thr Asn Lys Gln Arg Cys - # Phe Leu Ser Leu Arg Gly Glu     Thr     #   175     -      Arg Glu Ile Leu Cys Asp Asp Lys - # Leu Tyr Asn Val Leu Leu Ala     Val     #                190     -      Phe Asn Ser Tyr Asp Pro Asn Asp - # Leu Leu Lys His Ile Ser Thr     Val     #            205     -      Glu Ser Leu Lys Lys Ile Phe Tyr - # Thr Ile Thr Cys Glu Ala Val     Tyr     #        220     -      Leu     -      225     - (2) INFORMATION FOR SEQ ID NO:44:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 122 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:44:(xi) SEQUENCE DESCRIPTION: SEQ     -      Phe Gln Phe Glu Arg Lys Arg Met - # Lys Phe Phe Thr Arg Ile Thr     Asp     #   15     -      Ser Tyr Lys Lys Val Val Val Thr - # Leu Gly Leu Val Val Thr Thr     Asn     #                 30     -      Pro Leu Asn Ala Val Ala Ser Pro - # Thr Glu Gly Val Thr Ala Thr     Lys     #             45     -      Gly Leu Val Ile Gln Ile Ile Ser - # Val Leu Ala Ile Val Gly Gly     Cys     #         60     -      Ala Leu Gly Val Lys Gly Ile Ala - # Asp Ile Trp Lys Ile Ser Asp     Asp     #     80     -      Ile Lys Arg Gly Gln Ala Thr Val - # Phe Arg Tyr Ala Gln Pro Ile     Ala     #   95     -      Met Leu Ala Val Ala Gly Gly Ile - # Ile Tyr Leu Ser Thr Lys Phe     Gly     #                110     -      Phe Asn Ile Gly Glu Gly Gly Gly - # Ala Ser     #            120     - (2) INFORMATION FOR SEQ ID NO:45:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 87 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:45:(xi) SEQUENCE DESCRIPTION: SEQ     -      Ala Phe Val Ile His Lys Ile His - # Met Gly Asn Lys Met Glu Asn     Lys     #   15     -      Ser Ile Gly Gln Ile Phe Lys Asp - # Ser Leu Lys Lys Ser Phe Phe     Ser     #                 30     -      Gly Leu Trp Ser Cys Leu Lys Trp - # Ser Phe Ile Leu Thr Leu Ile     Ser     #             45     -      Leu Gly Leu Phe Leu Leu Val Phe - # Arg Phe Gln Pro Glu Thr Ile     Lys     #         60     -      Lys Tyr Ile Lys Asp Pro Lys Asp - # Leu Gln Phe Tyr Asn Asp Leu     Arg     #     80     -      Lys Lys Asn Gly Trp Asp Lys     #   85     - (2) INFORMATION FOR SEQ ID NO:46:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 178 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: peptide     #ID NO:46:(xi) SEQUENCE DESCRIPTION: SEQ     -      Lys Ser Val Met Lys Ile Gly Val - # Phe Asp Ser Gly Val Gly Gly     Phe     #   15     -      Ser Val Leu Lys Ser Leu Leu Lys - # Ala Gln Leu Phe Asp Glu Ile     Ile     #                 30     -      Tyr Tyr Gly Asp Thr Ala Arg Val - # Pro Tyr Gly Thr Lys Asp Pro     Thr     #             45     -      Thr Ile Lys Gln Phe Gly Leu Glu - # Ala Leu Asp Phe Phe Lys Pro     His     #         60     -      Gln Ile Glu Leu Leu Ile Val Ala - # Cys Asn Thr Ala Ser Ala Leu     Ala     #     80     -      Leu Glu Glu Met Gln Lys His Ser - # Lys Ile Pro Ile Val Gly Val     Ile     #   95     -      Glu Pro Ser Ile Leu Ala Ile Lys - # Arg Gln Val Lys Asp Lys Asn     Ala     #                110     -      Pro Ile Leu Val Leu Gly Thr Lys - # Ala Thr Ile Gln Ser Asn Ala     Tyr     #            125     -      Asp Asn Ala Leu Lys Gln Gln Gly - # Tyr Leu Asn Val Ser His Leu     Ala     #        140     -      Thr Ser Leu Phe Val Pro Leu Ile - # Glu Glu Ser Ile Leu Glu Gly     Glu     #    160     -      Leu Leu Glu Thr Cys Met Arg Tyr - # Tyr Phe Thr Pro Leu Lys Ile     Phe     #   175     -      Pro Lys     __________________________________________________________________________ 

What is claimed is:
 1. A composition comprising an isolated Helicobacter pylori CagI polynucleotide selected from the group consisting of SEQUENCE ID Nos. 9, 13, 17 and 21, or a polynucleotide encoding a polypeptide selected from the group consisting of SEQUENCE ID Nos. 3, 4, 10-12, 14-16, 18-20, 22-24, 26-40, and 45-46.
 2. A composition comprising a probe comprising the polynucleotide of claim
 1. 3. A diagnostic nucleic acid assay kit containing the probe of claim
 2. 4. A composition comprising a vector comprising the polynucleotide of claim
 1. 5. A composition comprising a host cell transformed with the vector of claim
 4. 6. A method for recombinately producing an H. pylori protein comprising culturing the host cell of claim 5 and isolating the recombinant polypeptide. 